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Preface 



The central challenge of theoretical computer science is to deploy mathematics in 
ways that serve the creation of useful algorithms. In recent years there has been a 
growing interest in the two-dimensional framework of parameterized complexity, 
where, in addition to the overall input size, one also considers a parameter, with 
a focus on how these two dimensions interact in problem complexity. 

This book presents the proceedings of the 1st International Workshop on Pa- 
rameterized and Exact Computation (IWPEC 2004, http://www.iwpec.org), 
which took place in Bergen, Norway, on September 14-16, 2004. The workshop 
was organized as part of ALGO 2004. There were seven previous workshops 
on the theory and applications of parameterized complexity. The first was or- 
ganized at the Institute for the Mathematical Sciences in Chennai, India, in 
September, 2000. The second was held at Dagstuhl Castle, Germany, in July, 
2001. In December, 2002, a workshop on parameterized complexity was held in 
conjunction with the FST-TCS meeting in Kanpur, India. A second Dagstuhl 
workshop on parameterized complexity was held in July, 2003. Another work- 
shop on the subject was held in Ottawa, Canada, in August, 2003, in conjunction 
with the WADS 2003 meeting. There have also been two Barbados workshops 
on applications of parameterized complexity. 

In response to the IWPEC 2004 call for papers, 47 papers were submitted, 
and from these the program committee selected 25 for presentation at the work- 
shop. In addition, invited lectures were accepted by the distinguished researchers 
Michael Langston and Gerhard Woeginger. 

This first instantiation of a biennial workshop series on the theory and appli- 
cations of parameterized complexity got its name in recognition of the overlap 
of the two research programs of parameterized complexity and worst-case expo- 
nential complexity analysis, which share the same formal framework, with an 
explicitly declared parameter of interest. There have been exciting synergies be- 
tween these two programs, and this first workshop in the IWPEC series attempts 
to bring these research communities together. 

The second workshop in this series is tentatively scheduled for the Gold Coast 
of Queensland, Australia, in July, 2006. An exact computation implementation 
challenge is being organized as a part of this second workshop. Details of the 
competition will be posted at http://www.iwpec.org. 

On behalf of the program committee, we would like to express our appreci- 
ation to the invited speakers and to all authors who submitted papers. We also 
thank the external referees who helped with the process. We thank the program 
committee for excellent and thoughtful analysis of the submissions, and the or- 
ganizers of ALGO 2004 in Bergen. We thank especially the tireless Frank Dehne 
for his efforts in almost all things relating to this conference and for co-editing 
these proceedings. 



Rod Downey and Mike Fellows, July 2004 
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Parameterized Enumeration, Transversals, and 
Imperfect Phylogeny Reconstruction* 



Peter Damaschke 



School of Computer Science and Engineering 
Chalmers University, 41296 Goteborg, Sweden 
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Abstract. We study parameterized enumeration problems where we are 
interested in all solutions of limited size, rather than just some minimum 
solution. In particular, we study the computation of the transversal hy- 
pergraph restricted to hyperedges with at most k elements. Then we ap- 
ply the results and techniques to almost-perfect phylogeny reconstruction 
in computational biology. We also derive certain concise descriptions of 
all vertex covers of size at most fc in a graph, within less than the trivial 
time bound. 



1 Introduction 

We suppose familiarity with the notion of fixed-parameter tractable (FPT) prob- 
lems, otherwise we refer to [8]. In many combinatorial optimization problems, 
one wants a particular solution where the parameter k is minimized. In the 
present paper we deal with the generation of all solutions with objective values 
bounded by parameter k. As a concrete application we study the reconstruction 
of almost perfect phytogenies. 

A perfect phylogeny (PP) is a tree with nodes labeled by bit vectors of length 
TO, and edges with labels from [to] = {!,... ,to} such that, for every i G [to], 
the vectors having 0 and 1, respectively, at position i are separated by exactly 
one edge labeled i (and hence form connected subtrees). This is a fundamental 
structure in computational biology, as it describes evolutionary trees where at 
most one mutation appeared at every position. Another application domain is 
linguistics [26]. Recently, PP attracted new attention as it supports haplotype 
inference.^ 

The bit vectors are usually represented as rows of an n x to matrix. The 
columns correspond to the positions, also called sites or loci. We speak of a PP 
matrix if there is a PP containing all its rows (and perhaps more bit vecors) 
as node labels. From a PP matrix one can uniquely reconstruct such a PP 
in 0{nm) time. (Here, uniqueness means: subject to isomorphism and to the 

* This work has been supported by a grant from the Swedish Research Council (Veten- 
skapsradet), file no. 621-2002-4574. 

^ It is quite impossible to cite all relevant papers here. The reader is referred to the 
proceedings of RECOMB 2002-2004, including satellite workshops. 
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ordering of edge labels on paths of degree-2 nodes.) Reconstruction can be done 
incrementally. Starting from an empty set of columns, add columns successively 
to the input and refine the PP. Details are not complicated, see e.g. [27, Section 
14.1]. One can generalize the notion of PP to non-binary cases, complexity results 
are in [1,2,19,25]. 

However, the PP assumption is often too strict. Repeated mutations at some 
loci, or recent immigration into a PP population leads to deviations from PP. Se- 
quencing errors are also common, hence corrupted data may lose the PP property 
even if the true data would form a PP. Thus one should allow a small number k 
of changes, i.e. bit flips in the matrix, or extra rows or columns, or combinations 
of them. This motivates a few computational problems: 

PP PLUS k ROWS: Given a binary matrix, find all sets of at most k rows the 
deletion of which leaves a PP matrix. 

PP PLUS k COLUMNS: Similarly defined. 

PP WITH k ERRORS: Given a binary matrix, find all sets of k bit flips such that 
the resulting matrix has a PP. 

Enumerating all solutions captures the applications better than the min- 
imization problem. There is no reason to assume that the smallest number of 
changes is always the correct explanation of data. Rather we want an overview of 
all consistent solutions, for at most k changes, and we also wish to reconstruct 
the part of the PP (i.e. without some rows or columns) common to all these 
conceivable solutions, the maximum agreement structure so to speak. Another 
phylogeny reconstruction problem has been studied in [14] from this perspective, 
see also [11] for more discussion of the importance of enumeration. 

More generally (but a bit vaguely perhaps) it can be said that parameterized 
enumeration is suitable when we want to recognize certain objects from given 
data which do not perfectly fit the expected structure. Then all potential solu- 
tions are required for further inspection. Applications besides phylogeny may be 
found e.g. in data mining. 

We will use a well-known characterization of PP matrices. A pair of columns 
is called complete if each of 00, 01, 10, 11 appears as a row in the submatrix 
induced by these two sites. Throughout the paper we refer to 00, 01, 10, 11 as 
combinations. The following has been discovered several times, see e.g. [15,28]. 

Theorem 1. A matrix is a PP matrix iff it does not contain complete pairs. □ 

This connects almost-PP reconstruction to the more abstract class of subset 
minimization problems: Given a set of n elements, a property tt of subsets, and 
some k, we want all minimal subsets of size at most k enjoying tt. Note carefully 
that the term minimal refers to set inclusion, not cardinality! We say that tt is 
closed under D if every Y Z) X has property tt whenever X has. Examples are 
vertex covers in graphs and hitting sets of set families (hypergraphs) . For such tt 
it suffices to know the minimal solutions, as they “represent” all solutions. This 
motivates the following 
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Definition 1. Given a subset minimization problem, a full kernel is a set whose 
size depends on k only and contains all minimal solutions of size at most k. 

We call a problem inclusion-minimally fixed parameter enumerable (IMFPE) 
if, for any instance of size n, all minimal solutions with value at most k are 
computable in time 0{f{k)p{n)) where p is polynomial and f any function. 

Once we have a full kernel then, trivially, we can also enumerate the mini- 
mal solutions in time depending on k only, hence a problem is IMFPE in this 
case. It is crucial to notice the seemingly little but important difference to the 
optimally/minimally fixed parameter enumerable (MFPE) problems in [11]. To 
avoid confusion with minimum size, we added the attribute “inclusion-” . 

The family of all minimal hitting sets to a given set family is known as 
the transversal hypergraph. Applications include models of boolean formulae, 
database design, diagnosis, and data mining. Known algorithms for generating 
the transversal include a pseudo-polynomial output-sensitive algorithm [12], al- 
gorithms for special hypergraph classes [3], and a practical heuristic based on 
simple but powerful ideas [20] . Here we are interested in the “pruned” transversal 
hyper graph consisting of the minimal hitting sets of size at most k. Apparently, 
generation of hitting sets by ascending size has not been addressed before, unlike 
e.g. lexicographic ordering [18]. 

Contributions and organization of the paper: In Section 2 we obtain 
IMPFE results for subset minimization probnlems. In Section 3 we apply these 
findings to almost-PP reconstruction. In Section 4 we give an algorithm that 
outputs a certain concise description of all small vertex covers of a graph within 
less than the trivial time bound. Due to limited space, we defer the detailed 
exposition of results to these sections, and we could only sketch most proofs and 
only convey the main ideas. 

We believe that the notions of IMFPE and a full kernel are more significant 
than some technical results which are built very much upon known research. (In 
particular, our results in Section 2 are close to [11], however, the new aspect is 
that the bounds still hold in the more demanding IMFPE setting.) The IMPFE 
concept is strict and sets limits to what clever algorithms could achieve, but as 
argued above, it seems to reflect the goals in certain applications well. Our focus 
is on theoretical results. Some experiments regarding the real performance on 
data of reasonable size would complete the picture. A handful open problems 
arise from the text. 

More related literature: Recently, almost-PP reconstruction has also been 
studied in [28] in a more general frame (destroying all occurences of a given small 
submatrix), however without time bounds for enumeration. Results in [10] are 
based on a different, distance-based imperfection measure. The viewpoint in [26] 
is more similar to ours, but the focus was different, and exhaustive search is used 
for PP with extra columns. Various computational biology problems allow FPT 
results, see e.g. [9,13,14]. Closely related to error correction in PP matrices is 
reconstruction of PP from incomplete matrices [25,16]). It might be interesting to 
look at this NP-hard from the FPT point of view. Papers [23,24] contain results 
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on directed PP reconstruction with missing entries. We mentioned maximum 
agreement problems (e.g. [17] gives an overview). Usually they have as input 
an arbitrary set of structures, rather than slight variants of one structure. In 
[6] we proposed a simple PP haplotyping algorithm for instances with enough 
genotypes, and the ideas in the present paper may lead to extensions to almost- 
PP populations. 

2 Hitting All Small Hitting Sets 

The Vertex Cover problem is FPT [5,21]: Given a graph G = (V,E) with n 
vertices and m edges, and a number k, find a fc-vertex cover, i.e. a set of at most 
k vertices that is incident to every edge. A full kernel for Vertex Cover is any 
subset of V that entirely contains all minimal fc-vertex covers in G. 

Lemma 1. Vertex Cover has a full kernel of size (1 + o(l))fc^. It can be 
constructed in 0{m) time. 

Proof. We show that the kernel from [4] is also a full kernel: Every k-vertex cover 
in G must contain the set H of vertices of degree larger than k. If we remove the 
vertices of H, all incident edges, and all vertices that had neighbors in H only, 
the remaining subgraph R has at most k"^ edges (or there is no solution at all), 
and hence less than 2/c^ vertices. Every minimal /c-vertex cover is the union of 
H and some minimal vertex cover of R. Thus, H U R is a full kernel. Factor 2 
can be improved to 1 + o(l) by more careful counting. (Omitted due to lack of 
space.) □ 

Remarks: 

(1) For the optimization version of Vertex Cover there exist kernels of 

size 2k [5], but 0{k^) is already the optimal worst-case bound for full kernels: 
In the disjoint union of m stars (one central vertex, joined to m leaves), 

the leaves of any star and the centers of all other stars build a fc-vertex cover, 
k = 2m — 1. Hence the full kernel has size about /c^/4. The optimal constant in 
0{k^) remains open. 

(2) It was crucial to restrict the full kernel to minimal vertex covers. If we 
dropped the minmality condition, the size would not even be bounded by any 
function of k. A simple example is the star Ki^n-i and k = 2: The center plus 
any leaf pair is a solution, and their union has size n. But the full kernel (merely 
the center) has size 1. 

In order to enumerate all fc-vertex covers we may construct the full kernel 
as and then apply the bounded search tree technique. Note that we distinguish 
nodes of the search tree from vertices of the graph. 

Theorem 2. Vertex Cover is IMFPE. All minimal solutions of size at most 
k can be enumerated in 0{m + fc^2^) time. 
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Proof. List all edges in the full kernel. Put a vertex from the first edge uv in 
the solution and branch for every choice (u or v). Label every new node by the 
vertex just selected. At any node proceed as follows: If some vertex in the edge 
listed next has already been chosen (i.e. it appears on the path from the root 
to the current node) , then skip this edge. Repeat this step until the condition is 
false. Else, select a vertex from the next edge and branch. 

Since this adds a new vertex to the solution on the considered tree path, but 
at most k vertices can be selected, the search tree has depth at most k, and at 
most 2^ leaves. Since every inner node has at least two children, the total size 
is 0(2^). Finally we prune the tree, that is, successively remove all leaves where 
the edge list has not been scanned completely. From the search tree we can read 
off all fc-vertex covers, as they are the label sets of paths from the root to the 
leaves. At every node we checked for every edge whether some of its vertices is 
already on the path. This gives immediately the time bound 0{k^2^). Pruning 
costs 0(2^) time. 

One easiliy verifies that any minimal vertex cover X appears, in fact, as some 
path in the search tree. 

Finally we also cut away leaves with non-minimal solutions X as follows. For 
every vertex in X, check whether all its neighbors are in X as well. Due to the 
degree bound in the kernel, this needs 0{k^2^) time. □ 

Hitting Set: Given a hypergraph G with n vertices and h hyperedges (subsets 
of vertices), and a number k, find a set of at most k vertices that hits every 
hyperedge. 

In c-Hitting Set, the cardinality of hyperedges is bounded by c, hence 
c = 2 is Vertex Cover. For recent results on c > 3 see [22]. Next we study the 
enumeration version of an even more general problem. By a multiedge we mean 
a family of at most c disjoint sets. We omit c if it is clear from context. The 
following problem statement needed in 3.1 is quite natural as such and may be 
of independent interest, however we are not aware of earlier mention of it. 

Bounded Union: Given h multiedges, i.e. families of at most c disjoint sets, 
each with at most d vertices, find a subset U of at most k vertices, that entirely 
includes at least one set from each multiedge. In other words, find a union of 
sets, one from each multiedge, with size bounded by k. 

We say that U settles a multiedge {S'!, . . . , Sc} ii Si C U for some i. Thus, 
a solution to Bounded Union must settle all multiedges. Note that Hitting 
Set is the special case when d = 1. On the other hand, Bounded Union is 
trivially reducible to Hitting Set: Replace every multiedge {S'!, . . . , Sc} with 
the collection of all jS'ij x . . . x jS'd hyperedges {si, . . . , Sc} such that Si G Si for 
i = 1, . . . ,c. Now, a set U hits all these hyperedges iff U settles the multiedge. 
It follows that this reduction also preserves all solutions. However, it blows up 
the input size by factor 0{d‘^). Thus, one better works directly on instances of 
Bounded Union, without the detour via this reduction. 
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Theorem 3. Bounded Union is IMFPE. All minimal solutions can be found 
in 0{dc^~^^h + mm{kc^'^,hkc’‘}) time. 

Proof. Again, we construct a bounded search tree, but now on the whole in- 
stance. List the given multiedges. Select a set from the first multiedge and branch 
for every choice. At any node proceed as follows: If the multiedge listed next is 
already settled by the union of previously selected sets on the tree path, then 
skip it. Repeat this step until the condition is false. Else, select a set from the 
next multiedge and branch. Since this adds at least one new element to the 
union, the search tree has depth at most k, at most leaves, and O(c^) nodes 
in total. From the search tree we can read off all unions: In any path from the 
root to a leaf, collect the sets specified by the path. Completeness of the solution 
space can be easily established. As for the time bound, note that on each path, 
every multiedge is processed only once in 0{cd) time. 

A naive method for filtering the non-minimal solutions is pairwise comparison 
in 0{k(?^) time. Testing the minimality of every solution X is faster \i h < . 

Proceed as follows. For every multiedge e, list the vertices of X contained in 
e. If exactly one set S' of e satisfies S C X, then the vertices in S are not 
redundant. Mark all non-redundant vertices found that way. First suppose that 
all multiedges are already settled by these marked vertices. In this case, X is 
non-minimal iff X contains further, unmarked vertices. This check needs 0{hk) 
time. The other case is that some multiedges are not yet settled by the marked 
vertices. But since A is a solution, we conclude two things: (1) Not all vertices 
in X are marked. (2) For every multiedge, either one set consists of marked 
vertices only, or at least two sets are completely in X. Hence, we can remove 
an unmarked vertex from X, and still some set of every multiedge is in A. This 
means, A is not minimal, and we do not need further tests. □ 

We can show that a smaller full kernel exists in case k > c, thus generalizing 
a result from [22] . 

Theorem 4. For any instance o/ H itting Set or Bounded Union, an equiva- 
lent instance with no more than k'^ hyperedges can he obtained in time 0{ck'^~^h). 
Consequently, both problems have a full kernel of size ck‘^. 

Proof. First we count how often every vertex appears in the hyperedges, in 
0{cdh) time, going through the h hyperedges or multiedges. (For an instance of 
Bounded Union, there is no need to perform the reduction to Hitting Set 
explicitly, as we know the cardinalities of sets in the multiedges.) 

Suppose that each vertex appears in at most k'^~^ hyperedges. Then, a set 
of size k can hit at most k'^ hyperedges. If there is a solution at all, the instance 
contains only that many hyperedges, with a total of fc -I- (c — l)k‘^ vertices, and 
we are done. Otherwise we select a vertex and k‘^~^ 1 hyperedges containing 

it. 

Suppose by induction that we have found a set C of size i, and a family Fli 
of -I- 1 hyperedges with C as subset. Either (1) some C U {y}, y ^ C is in 
at least fc'=-(*+i) -|- 1 hyperedges of Hi, or (2) k distinct vertices y ^ C are not 
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enough to hit all hyperedges of Hi. In case (1), the induction hypothesis holds 
for z + 1. In case (2), each hitting set of size k must also hit C. But then we can 
create a hyperedge C and delete supersets of C in Hi from the instance, without 
altering the solution space. 

This case distinction can be decided in 0((c — z)fc'^“®) time, since it suffices 
to consider all y from the union of members of Hi. We find the hyperedges in 
Hi that are to be deleted within the same time. If case (I) still holds for i = c, 
we have two copies of the same hyperedge and can also delete one. Altogether, 
we reduced the number of hyperedges, in time. 

The procedure is repeated less than h times. The vertex counters can be 
updated in time proportional to cd times the number of deleted hyperedges, 
which is a total of 0{cdh). Finally note that d < k can be assumed. □ 

Combining the two results, we improve the coefficient of h from Theorem 3, 
provided that k > c: 

Corollary 1. All minimal solutions of Bounded Union can be computed in 
0{ck‘^~^h + dc^^^H + time. 

Proof. Construct an instance that has the same solutions but at most (rather 
than h) hyperedges, as in Theorem 4, then run the algorithm from Theorem 3 
on it. □ 

3 Imperfect Phylogeny Reconstruction 

3.1 Extra Rows 

If an instance of PP plus k ROWS has a solution at all, then, in any complete 
pair, one of 00, 01, 10, 11 appears in at most k rows. At most 3 of these com- 
binations appear at most k rows, unless k > n/4. In the following we implicitly 
assume n > 4k, remember that /c is a fixed parameter. Destroying the complete 
pair means to remove all rows that contain one of 00, 01, 10, 11. This reduces 
PP PLUS k ROWS to Bounded Union: The rows of the matrix are elements of 
the ground set, and for every complete pair of columns i,j we define a multiedge 
whose sets are the sets of rows containing 00, 01, 10, 11, respectively, at sites i,j. 
Trivially, it is enough to keep sets of at most k rows. This gives h < (™) < m^, 
c = 3, and d = k. 

Before we state our theorem, we discuss a naive application of the Bounded 
Union results: Construct the multiedges from the matrix, then solve this in- 
stance of Bounded Union in 0{k^mf + S’^k^) time (Corollary 1). To this end 
we may check all 0{m?) column pairs for completeness. But, unfortunately, for 
each pair we have to look at almost all rows, thus preprocessing needs 0{nmf) 
extra time. We get 0{nmf + ?>^k^) and lose the benefits of a small kernel. By 
an idea mentioned in [16], the complete pairs of an n x m matrix can be found 
already in time, where 0{n‘^) is a bound for matrix multiplication. 

But still, the dependency in m is not linear. We omit any details, because the 
following time bound is anyhow an improvement, unless 3^ > 
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Theorem 5. PP plus k ROWS is IMFPE. All minimal solutions can be com- 
puted in time. We ean also identify at most rows such that the PP 

without these rows can he correctly reconstructed within this time hound. 

Proof. First find some complete pair in 0{nm) time as follows: Construct a PP 
incrementally, until a failure occurs at column j, say. Column j together with 
some column i < j forms a complete pair. To find such i, go through the columns 
i < j again. The complete pair i,j contains at most three of 00, 01, 10, 11 at 
most k times. Label the affected rows in the matrix, and branch if there are two 
or three such combinations. Continue on the matrix without the selected rows. 
In order to avoid copying the whole matrix for use on the different tree paths, we 
can work on the original matrix and observe the row labels. This way we spend 
only 0{nm) time at each of the 0(3^) nodes. The second assertion follows from 
Theorem 4: All minimal solutions are in a full kernel of size 3fc^. □ 



3.2 Extra Columns 

Theorem 1 immediately reduces PP plus k COLUMNS to Vertex Cover: The 
reduction graph RG has the columns as vertices and the complete pairs as edges. 
This reduction costs naively 0{nmf) time, or 0{nmP~^) time using the idea from 
[16] mentioned above. But we can do better, unless kf > 

Theorem 6. PP plus k columns is IMFPE. All minimal solutions can be 
computed in 0{k^nm + time. We can aslo identify at most 2k^ columns 

and correctly reconstruct the PP without these sites in 0{k^nm) time. 

Proof. Build a PP incrementally, adding column by column. When we get stuck, 
we know a column y that is in some complete pair. Find all complete pairs with 
y, in the obvious way in 0{nm) time. If these are more than k pairs, y belongs 
to every solution and can be discarded. Clearly, this case can appear at most k 
times in the course of the algorithm. If y has degree at most k, we next consider 
the neighbors of y in the graph RG, we determine their neighbors as above, 
etc. Thus we construct some connected subgraph of RG by breadth-first-search 
(BFS). Recall that at most k'^ edges are incident to vertices of degree at most 
k, if there is a solution at all. If the BFS stops, we discard all columns reached 
by BFS and repeat the whole procedure, as there could exist other connected 
subgraphs of low-degree vertices. Thus, we eventually construct the full kernel 
in 0{k^nm) time. Then, use Theorem 2 to produce all minimal vertex covers of 
the kernel in 0{kf2^) time. □ 

Theorem 6 exploited only the fact that a few columns do not fit in a PP. 
These “bad” columns could look arbitrarily. It arises the question whether we 
can take advantage of additional assumptions, if we know the way the matrix 
has been produced (e.g. by random mutations). 
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3.3 A Full Kernel for Error Correction 

Errors in data are more delicate than missing entries. Changing k bits in a 
PP matrix corrupts at most k rows/colummns. We would like to find all PP 
correction sets, i.e. combinations of at most k bits whose change gives a PP 
matrix. This subset property is no longer closed under D . However we may aim 
at finding all minimal PP correction sets of size at most k. In this formulation, 
the problem has a full kernel, which is perhaps not obvious. Our bound is pretty 
high, but we expect a much better “real” complexity for typical instances. 

Theorem 7. PP with k ERRORS has a full kernel o/3fc(fc+l)^6^ matrix entries 
which can be determined in 0{k&^nm) time. 

Proof. Rather long, omitted due to lack of space. 

4 All fc-Vertex Covers in o{2^poly{ri)) Time 

Base 2 in Theorem 2 cannot be improved in general, because there are graphs 
with 2^ minimal vertex covers of size k, for instance k disjoint edges. Hence 0(2^) 
is, subject to a polynomial factor, the optimal time bound for enumerating all 
minimal fc-vertex covers. On the other hand, the solution space for such a trivial 
graph is the Cartesian product of these edges. It can be described in just this 
way and, if desired, explicitly enumerated afterwards, with small delay. 

This observation motivates a modification of the problem: Compute a de- 
scription of the solution space from which all solutions can be enumerated with 
limited delay. (It is straightforward to give formal definitions.) Such a descrip- 
tion would typically consist of a union of (fewer than 2^) simple parts of the 
solution space. Note that this goal is different from enumerations with e.g. poly- 
nomial delay. Also, we measure the worst-case complexity in terms of parameter 
k rather than output size, i.e. we do not necessarily get output-sensitivity. 

In this section we derive a result of this type for fc- Vertex Cover, by 
a search tree algorithm that makes decisions for vertices rather than edges. 
As usual in graph theory, N(v) denotes the set of neighbors of vertex v, and 
N[v] = N (v) U {t}. For a graph G = {V, E) and a set V QV , G — Y is the graph 
after removal of Y and all incident edges. Our algorithm needs a more general 
problem statement, and (^-Vertex Cover is the special case when Y = %: 

Minimal Vertex Cover Extension (MVCE): 

Given a graph G = {V, E) and a set Y dV , find all sets X enjoying these two 
properties: (1) X D Y, and (2) A is a minimal vertex cover in G. Let fc-MVCE 
be the same problem with the additional demand (3) \X\ < k. 

It is NP-hard to decide whether an instance of MVCE has a solution [3]. A 
vertex in X is said to be redundant in X if all its neighbors are in X, too. 

Our algorithm for fc-MVCE sets up a search tree with the given G, k, and 
Y at the root. By t{k) we will denote the number of leaves of the decision tree, 
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where argument k is the parameter at the root, i.e. the number of vertices that 
may still be added to Y.^ 

At every tree node we first check if |K| > fc, or if some vertex in Y is redundant 
in Y. In these cases no solution X ^Y can exist, and the tree node becomes a 
dead end. Otherwise we choose some v ^Y and create two children of the node. 
In one branch we add v to Y and set k' := k — 1. In the other branch we decide 
that V ^ Y, which has some immediate implications: We put all neighbors of v in 
Y (they have to be chosen anyway) and set k' := k — |fV(u)|. Then we diminish 
the graph by removing N[v] and all incident edges, and we set Y' := Y \ N[v]. 
This routine is correct, since the solutions to /c-MVCE with v ^ Y correspond 
exactly to the solutions to fc'-MVCE in G — fV[a;] (with Y' instead of Y). 

Specifically, let v in any tree node be a vertex with maximum number of 
neighbors outside Y. Let us run this algorithm until no more vertex v (jiY with 
at least two neighbors outside Y exists, in any tree node. For all current leaves 
of the search tree this means that G — Y merely consists of isolated vertices and 
edges. Isolated vertices v in G — Y cannot be added to Y anymore (otherwise v 
would be redundant). 

If we aborted the algorithm at this moment, the number of leaves t{k) would 
be bounded by the recurrence t{k) < t{k — 1) + t{k — 2). Remember that we 
always added either v or its two (or more) neighbors to Y. The solution would 
be t{k) = 0{(j)^), with (j) = (-\/5 + l)/2 < 1.62. The difficulty is the presence 
of yet unresolved isolated edges in G — E. We have to add exactly one vertex 
from every isolated edge to Y, but in general not every choice gives a minimal 
solution to MVCE, so that a concise description is not obtained yet. To make 
things worse, since any v has only one neighbor in G — E, also the recurrence 
would no longer apply to further branches. Therefore we cannot prove an 0{(j>^) 
bound. However, a refinement of both the algorithm and the analysis will still 
result in a base y <2. 

Theorem 8. A description of all minimal vertex covers of size at most k, using 
only Cartesian products of edges and set union as set operations, can he computed 
in 0{y^p{n)) time, where y ~ 1.8, and p is some polynomial. 

Proof. We assign a height to every tree node as follows. For a constant r to be 
specified later, the root gets height {l + r)k. Whenever children of a tree node are 
created, both inherit the height from their parent. Then, the child with v G Y 
reduces its height by 1, for having added u to E. The child with v ^ Y reduces 
its height by 1 for each neighbor just added to E, and by r for each of the |IV(f)| 
vertices of E removed from the graph. Since every vertex is added to E and 
removed from the graph at most once along a tree path, and |E| < k, the height 
never becomes negative. 

In a first phase we do the search tree construction as above, until every G — Y 
is composed of isolated edges. The height is reduced by 1 in case u G E, and by 

^ We will spend polynomial time at each tree node, hence the time bound is t{k)p{n) 
for some polynomial p. Here, our analysis will focus only on the more interesting 
t{k) factor. 
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at least 2 + 2r in case v ^ Y , since at least two neighbors have been put in Y 
and then removed from the graph. 

In a second phase we continue as follows, at every leaf of the search tree. 
If no vertex in G — Y has neighbors in Y , we can already concisely describe 
the minimal vertex covers X ^ Y: Every solution X is the union of Y and 
exactly one vertex from every isolated edge, and these vertices can be chosen 
independently. In this case we stop at that leaf and output the description of 
this part of the solution space. (But if |E| plus the number of isolated edges 
exceeds k, we have a dead end.) 

The other case is that some vertex outside Y still has some neighbor in Y. 
We choose such a vertex as v. In the branch for v G Y we know that f’s unique 
neighbor u outside Y cannot be added to Y anymore. Thus we apply the rule 
for M ^ y (remove N[u] etc.) which reduces the height by 1 + r, since v has just 
been added to Y and then removed from the graph. In the branch for v ^ Y we 
remove at least two members of Y from the graph, due to the choice of v. This 
reduces the height by 1 + 2r, including summand 1 for having added u to Y. 

Recurrence t{k) < t{k — 1) + t{k — 2 — 2r) applies to the first phase, and 
t{k) < t{k — 1 — r) + t{k — 1 — 2r) to the second. The characteristic equation 
is + 1 and = a:’’ + 1, respectively. Define y = a;^’*''’, s = 

r/(l + r), and z = y®. Note that y is the base we obtain, because the tree size is 
0(t((l + r)k)). Since k = 0 leads to no branching, we can set t(0) = 1. 

Rewrite the characteristic equations as y^ = yz + 1 and yz = z + 1, re- 
spectively. Since 0 < s < 1, our z must also satisfy 1 < z < y. Substituting 
z = l/(y— 1) in the first equation, we obtain y^ — y^ = 2y— 1, hence y « 1.8. □ 

The approach can be extended far beyond this result. We briefly mention 
one possible refinement. Let us accept isolated paths and cycles in the subgraphs 
G — Y. (Minimal vertex covers of paths and cycles are still easy to enumerate.) 
Then we only have to take vertices v with at least 3 neighbors outside Y in the 
first phase. Following the lines of the previous proof, we can get y « 1.39. 

A nice special case of the problem is bipartite graphs: If vertices v in the 
algorithm are restricted to one partite set, MVCE has always a solution, hence 
dead ends can appear only because the size bound k is exceeded. However the 
corresponding minimization problem is NP-hard, which extends a result from 
[3]. (We have to omit the proof here.) 
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Abstract. We explore the effects of using graph width metrics as re- 
strictions on the input to online problems. It seems natural to suppose 
that, for graphs having some form of bounded width, good online algo- 
rithms may exist for a number of natural problems. In the work presented 
we concentrate on online graph coloring problems, where we restrict the 
allowed input to instances having some form of bounded pathwidth. We 
also consider the effects of restricting the presentation of the input to 
some form of bounded width decomposition or layout. A consequence of 
our work is the clarification of a new parameter for graphs, persistence, 
which arises naturally in the online setting, and is of interest in its own 
right. 



1 Introduction 

The last 20 years has seen a revolution in the development of graph algorithms. 
This revolution has been driven by the systematic use of ideas from topological 
graph theory, with the use of graph width metrics emerging as a fundamen- 
tal paradigm in such investigations. The role of graph width metrics, such as 
treewidth, pathwidth, and cliquewidth, is now seen as central in both algorithm 
design and the delineation of what is algorithmically possible. In turn, these ad- 
vances cause us to focus upon the “shape” , or “the inductive nature of” , much 
real life data. Indeed, for many real life situations, worst case, or even average 
case, analysis no longer seems appropriate, since the data is known to have a 
highly regular form, especially when considered from the parameterized point of 
view. 

It turns out, however, that the classic algorithms generated through the use 
of width metrics usually rely upon dynamic programming, and so are highly 
unsuited to the focus of this paper, the online situation. 

The usual theoretic model for online problems has the input data presented 
to the algorithm in small units, one unit per timestep. The algorithm produces a 
string of outputs: after seeing t units of input, it needs to produce the t-th unit of 
output. Thus, the algorithm makes a decision based only on partial information 
about the whole input string, namely the part that has been read so far. How 
good the decision of the algorithm is at any given step t may depend on the 
future inputs, inputs that the algorithm has not yet seen. 
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Real-life online situations often give rise to input patterns that seem to be 
“long” and “narrow”, that is, pathlike. For instance, consider the online schedul- 
ing of some large collection of tasks onto a small number of processors. One 
might reasonably expect a pattern of precedence constraints that gives rise to 
only small clusters of interdependent tasks, with each cluster presented more or 
less contiguously. Alternatively, one might expect a pattern of precedence con- 
straints giving rise to just a few long chains of tasks, with only a small number 
of dependencies between chains, where each chain is presented more or less in 
order. One could argue that the most compelling reason for attempting to solve 
a problem online is that the end of the input is “too long coming” according to 
some criteria that we have. Given such a situation, we attempt to do the best 
we can with the partial information available at each timestep. 

The goal of this paper is to introduce a new program in which we plan to 
apply ideas from topological graph theory to online situations. In particular, 
we intend to study online width metrics. It is now commonplace, in the offline 
setting, to find that by restricting some width parameter for the input graphs, a 
particular graph problem can be solved efficiently. A number of different graph 
width metrics naturally arise in this context which restrict the inherent complex- 
ity of a graph in various senses. The central idea is that a useful width metric 
should admit efficient algorithms for many (generally) intractable problems on 
the class of graphs for which the width is small. One of the most successful 
measures in this context is the notion of treewidth which arose from the seminal 
work of Robertson and Seymour on graph minors and immersions [16] . Treewidth 
measures, in a precisely defined way, how “tree-like” a graph is. The idea here is 
that we can lift many results from trees to graphs that are “tree-like” . Related 
to treewidth is the notion of pathwidth which measures, in the same way, how 
“path- like” a graph is. 

Our long-term goals are two-fold. Firstly, we seek to understand the effects of 
using width metrics as restrictions on the input to online problems. As mentioned 
above, online situations often give rise to input patterns that seem to naturally 
conform to restricted width metrics, in particular to bounded pathwidth. We 
might expect to obtain online algorithms having good performance, for various 
online problems, where we restrict the allowed input to instances having some 
form of bounded pathwidth. 

Secondly, we seek to understand the effects of restricting the presentation of 
the input to some form of bounded width decomposition or layout. The method 
of presentation of the input structure to an algorithm has a marked effect on 
performance. Indeed, this observation underpins the study of online algorithms 
in the first place. 

To lay the foundations of the program described here, we concentrate on 
online graph coloring. There has been a considerable amount of work done on 
online graph coloring, much of it related to earlier work on bin packing. However, 
approaches similar to the one that we take are notably absent from the literature. 

We remark that one consequence of our study to date has been the clarifica- 
tion of a parameter, persistence, which seems completely natural in the online 
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situation, and is of interest in its own right. We are sure that other new param- 
eters will come to light. We see the present paper as laying the foundations for 
our ideas. Future studies will be concerned both with other applications, and 
with making foundations for online descriptive complexity. 

2 Preliminaries 

2.1 Treewidth and Pathwidth 

Many generally intractable problems become tractable for the class of graphs 
that have bounded treewidth or bounded pathwidth. Furthermore, treewidth and 
pathwidth subsume many graph properties that have been previously mooted, in 
the sense that tractability for bounded treewidth or bounded pathwidth implies 
tractability for many other well-studied classes of graphs. For example, planar 
graphs with radius k have treewidth at most 3fc, series parallel multigraphs have 
treewidth 2, chordal graphs (graphs having no induced cycles of length 4 or 
more) with maximum clique size k have treewidth at most k — 1, graphs with 
bandwidth at most k have pathwidth at most k. 

A graph G has treewidth at most k if we can associate a tree T with G in 
which each node represents a subgraph of G having at most k + 1 vertices, such 
that all vertices and edges of G are represented in at least one of the nodes of T, 
and for each vertex v in G, the nodes of T where v is represented form a subtree 
of T. Such a tree is called a tree decomposition of G, of width k. We give a formal 
definition here: 

Definition 1. [Tree decomposition and Treewidth] 

Let G = {V, E) be a graph. A tree decomposition of G is a pair (T,X) where 
T = (/, F) is a tree, and X = {Xi | i € /} is a family of subsets of V, one for 
each node of T, such that 

1 - = 

2. for every edge {u, w} G E, there is an i G I with v G Xi and w G Xi, 

3. for all i,j, k G I, if j is on the path from i to k in T, then Xi H Xk C Xj. 

The treewidth or width of a tree decomposition ((/, A), | i S /}) is 

maxi^i \Xi\ — 1. The treewidth of a graph G is the minimum width over all 
possible tree decompositions of G. 

Definition 2. [Path decomposition and Pathwidth] 

A path decomposition of a graph G is a tree decomposition {P,X) of G where P 
is simply a path (i.e. the nodes of P have degree at most two). The pathwidth of 
G is the minimum width over all possible path decompositions of G. 

2.2 Competitiveness 

We measure the performance of an online algorithm, or gauge the difficulty of 
an online problem, using the concept of competitiveness, originally defined by 
Sleator and Tarjan [17]. 
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Suppose that P is an online problem, and A is an online algorithm for P. 
Let c > 1 be a constant. We say that A is c-competitive if, for any instance I 
of problem P, costA{I) < c • costopt(I) + b, where opt is an optimal offline algo- 
rithm that sees all information about the input in advance, and 6 is a constant 
independent of I. If an algorithm A is c-competitive, then we say that A has a 
performance ratio of c. We say that a given online problem P is c-competitive if 
there exists a c-competitive algorithm for P, and we say that it is no better than 
c-competitive if there exists no c'-competitive algorithm for P for any c' < c. 

3 Online Presentations 

The usual definition of an online presentation of a graph G is a structure 
G^ = (V,E,<) where < is a linear ordering of V. G is presented one vertex 
per timestep, v\ at time 1, U 2 at time 2, . . . and so on. At each step, the edges 
incident with the newly introduced vertex and those vertices already “visible” 
are also presented. We use the terms online presentation and online graph inter- 
changeably. 

Let Vi = {vj \ j < i} and Gf = G^\Vi], the online subgraph of G< induced 
by Vi- An algorithm that solves some online problem on G will make a decision 
regarding Vi (and/or edges incident with Vi) using only information about Gf. 

In this paper we introduce a different method of presenting a graphlike struc- 
ture online. First, fix some arbitrary constant (parameter) k. At each timestep 
we present one new active vertex that may be incident with at most k active 
vertices previously presented. Once a vertex has been presented we may render 
some of the current set of active vertices inactive in preparation for the intro- 
duction of the next new vertex. At no point do we allow more than fc -|- 1 active 
vertices, and we do not allow a new vertex to be incident with any inactive 
vertex. 




Fig. 1. Online presentation of a pathwidth 3 graph using 4 active vertices. Vertex 
V remains active for 3 timesteps. 
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These requirements mean that any graph presented in this fashion must 
have bounded pathwidth (pathwidth k). We are, in effect, presenting the graph 
as a path decomposition, one node per timestep. We denote such an online 
presentation of a graph G as path fc 

We can add the further requirement that any vertex may remain active for at 
most I timesteps, for some arbitrary constant (parameter) 1. We say that a path 
decomposition of width k, in which every vertex of the underlying graph belongs 
to at most I nodes of the path, has pathwidth k and persistence I, and say that 
a graph that admits such a decomposition has bounded persistence pathwidth. 
We explore this natural notion in Section 6, but we remark here that we believe 
that it truly captures the intuition behind the notion of pathwidth. 

An online graph that can be presented in the form of a path decomposition 
with both low width and low persistence is properly pathlike, whereas graphs 
that have high persistence are, in some sense, “unnatural” or pathological. Con- 
sider the graph G presented in Figure 2. G is not really path-like, but still has 
a path decomposition of width only two. The reason for this is reflected in the 
presence of vertex a in every node of the path decomposition. 



G c b 




Fig. 2. A graph G having low pathwidth but high persistence. 



Persistence appears to be a natural and interesting parameter in both the online 
setting and the offline setting. For many problems where the input is generated 
as an ordered sequence of small units, it seems natural to expect that the sphere 
of influence of each unit of input should be localised. 

4 Online Coloring 

An online algorithm A for coloring an online graph G^ will determine the color 
of the ith vertex of G^ using only information about Gf. A colors the vertices of 
G^ one at a time in the order vi < V 2 ,- ■ and at the time a color is irrevocably 
assigned to Uj, the algorithm can only see Gf. 

A simple, but important, example of an online algorithm is First-Fit, which 
colors the vertices of G"^ with an initial sequence of the colors {1,2,...} by 
assigning to Vi the least color that has not already been assigned to any vertex 
in Gf that is adjacent to Vi. 
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Szegedy [18] has shown that, for any online coloring algorithm A and integer 
k, there is an online graph on at most fc(2^ — 1) vertices with chromatic num- 
ber k on which A will use 2^ — 1 colors. This yields a lower bound of J?( ) 

for the performance ratio of any online coloring algorithm on general graphs. 
Note that the worst possible performance ratio on general graphs is n. Lovasz, 
Saks, and Trotter [14] have given an algorithm that achieves a performance ratio 
on all graphs. 

Online coloring of some restricted classes of graphs has been considered. In 
the bipartite case it can be shown that, for any online coloring algorithm A and 
integer k, there is an online tree with 2*“^ vertices on which A will use at 
least t colors. Thus, we get a lower bound of Q{log n) for any online algorithm 
on bipartite graphs. Lovasz, Saks, and Trotter [14] give an algorithm that colors 
any bipartite online graph using at most 1-1-2 log n colors. 

Kierstead and Trotter [11] have given an online coloring algorithm that 
achieves a performance ratio of 3 on interval graphs, which is also best pos- 
sible. Kierstead [9] has shown that First-Fit has a constant performance ratio 
on the class of interval graphs. Gyarfas and Lehel [6] have shown that First-Fit 
achieves a constant performance ratio on split graphs, complements of bipartite 
graphs, and complements of chordal graphs. 

One approach that is similar in flavour to ours is presented by Irani [7]. 
Irani introduces the notion of d-inductive graphs. A graph G is d-inductive if 
the vertices of G can be ordered in such a way that each vertex is adjacent to 
at most d higher numbered vertices. Such an ordering on the vertices is called 
an inductive order. As for a path or tree decomposition, an inductive order is 
not necessarily unique for a graph. An inductive order of a graph G defines 
an inductive orientation of G, obtained by orienting the edges from the higher 
numbered vertices to the lower numbered vertices. Notice that, in an inductive 
orientation, the indegree of each vertex is bounded by d. Hence, any d-inductive 
graph is d -I- 1 colorable. 

In [7] it is shown that, if G is a d-inductive graph on n vertices, then First- 
Fit uses at most 0(d • log n) colors to color any online presentation G^ of G. 
Moreover, for any online coloring algorithm A, there exists a d-inductive online 
graph G"^ such that A uses at least Q{d ■ log n) colors to color G"^. 

A connection between graphs of bounded pathwidth or bounded treewidth, 
and inductive graphs, is given by the following lemma (see [13], or [15]). 

Lemma 1. Any graph G of pathwidth k, or treewidth k, is k-inductive. 



5 Online Coloring of Graphs with Bounded Pathwidth 

We consider two ways in which to formulate the problem of online coloring of 
graphs with bounded pathwidth. 

We can define a parameterized “presentation” problem, where we fix a bound 
k on the pathwidth of any input graph G, and then proceed to present G as an 
implicit path decomposition, in the manner described in Section 3 above. 
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Alternatively, we can define a parameterized “promise” problem, where we 
fix a bound k on the pathwidth of any input graph G, and then proceed to 
present G as a structure G^ = (V,E, <) where < is an arbitrary linear ordering 
off/. 

5.1 The Presentation Problem 

If we undertake to present a graph G in the form of an implicit path decomposi- 
tion, then we are effectively enforcing the presentation to be, if not best-possible, 
then at least “very good” for First-Fit acting on G. 

Lemma 2. If G is a graph of pathwidth k, presented in the form of an implicit 
path decomposition, then First-Fit will use at most k-\-l colors to color G^ ^ . 

This is easy to see, since, at each step i, 0 < i < n, the newly presented 
vertex Vi will be adjacent to at most k already-colored vertices. This result is 
best possible in the sense that the chromatic number (and, therefore, the online 
chromatic number) of the class of graphs of pathwidth k is fc -I- 1. However, 
note that G <p*^**' ^ niay not contain all of the information required to color G 
optimally online, as the following lemma shows. 

Lemma 3. For each k > 0, there is a tree T of pathwidth k presented as 
T< ^ on which First-Fit can be forced to use k -\- 1 colors. 

Proof. Suppose Tq is a connected tree with pathwidth 0, then T must consist of 
a single vertex (any graph of pathwidth 0 must consist only of isolated vertices) 
so First-Fit will color patho color. 

Suppose Ti is a connected tree with pathwidth 1 that has at least two vertices. 
Each vertex of ’’*‘*^*' ^ can be adjacent to at most one active vertex at the time 
of presentation. Since T\ is connected, there must be vertices that are adjacent 
to an active vertex at the time of presentation in any ^ . Thus, First-Fit 

will need to use two colors to color any p^**' ^ . 

Now, suppose that for any 0 < t < k, there is a tree Tt of pathwidth t, and a 
presentation Tf' p^*^*'*^ on which First-Fit can be forced to use t -\- 1 colors. We 
build a connected tree Tk with pathwidth k, and a presentation 
which First-Fit will be forced to use fc -|- 1 colors. 

We order the trees It, 0 < t < fc, and their presentations, in descending order 
Tfc_ 2 , . . . , To], and concatenate the presentations together in this order to 
obtain a new presentation Note that the subsequence P^tht will 

have at most (t -I- 1) < fc active vertices at any stage. 

To obtain p®'*^ we alter as follows. For each t, 0 < t < k, we choose 
the vertex vt from Tf' p^*^ * that is colored with color t -|- 1 by First-Fit and 
allow it to remain active throughout the rest of Every other vertex from 
j,< path t rendered inactive at the conclusion of Tf~ * in the concatenated 
presentation. Thus, at any stage of there will be at most fc -|- 1 active 

vertices, and at the conclusion of there will be fc active vertices, one from 
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each of the 0 < t < k. These k active vertices will be colored with 

colors 1, 2, . . . , fc respectively. We now present one new vertex, adjacent to each 
of the k active vertices, which must be colored with color fc + 1. □ 



k active vertices 
colored Ci . . . Cfe 




Fig. 3. Schema of ^ on which First-Fit can be forced to use A: -|- 1 colors. 



5.2 The Promise Problem 

Now, let us consider the “promise” problem for online coloring of graphs of 
bounded pathwidth. We first consider the case where G is a tree of pathwidth fc. 

Lemma 4. First-Fit will use at most 3fc -I- 1 colors to color any T"^ where T is 
a tree of pathwidth fc. 

Proof. Let fc = 0, then T consists only of an isolated vertex, and First-Fit 
requires only one color to color T^. 

Let fc be > 1. Suppose that the bound holds for fc — 1: for any tree T of 
pathwidth at most fc — 1 First-Fit colors any with at most 3k — 2 colors. 

We rely on the fact that any tree T of pathwidth fc consists of a path P and 
a collection of subtrees of pathwidth at most fc — 1, each connected to a single 
vertex on the path P (see [4]). 

Let Vi be a vertex appearing in one of the subtrees of T. When Vi is presented 
in T^, at time i, it will be colored by First-Fit using a color chosen from the 
first 3fc — 1 colors of {1,2, . . .}. 

Let r* be the subtree in which Vi appears. Let pi be the path vertex to which 
the subtree T* is connected. Let Vi = {vj \ j < i} and Tf' = T<[Fi], the online 
subgraph of induced by Vi . 

Suppose that pi is not present in Tf' . Then the component of Tf' containing 
Vi is a tree of pathwidth at most fc — 1, disjoint from all other components of 
T^. Thus, First-Fit will color Vi using a color chosen from the first 3k — 2 colors 
of {1,2,...}. 

Suppose that pi is present in Then, in Tf-, Vi will be adjacent to at 

most one vertex in the component of containing pi. Suppose that this is 
the case and that this vertex has been colored with some color Ci. Consider 
the other components of to which Vi may become connected. Together 
with Vi, these form a tree of pathwidth at most fc — 1. First-Fit will require 
at most 3k — 2 colors to color this tree, but Ci cannot be used to color Vi. If 
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Ci ^ {1,2,...,3A: — 2} then First-Fit will color Vi using a color chosen from 
{1, 2, . . . , 3A: — 2}. If Ci S {1, 2, . . . , 3A: — 2} then First-Fit will color Vi using a 
color chosen from {1, 2, . . . , 3fc — 1} — Ci. If, in , Vi is not connected to the 
component of containing pi, then First-Fit will color Vi using a color chosen 
from {1, 2, . . . , 3A: — 2}. 

Let Vi be a vertex appearing in the path of T. When Vi is presented in 
at time i, it will be colored by First-Fit using a color chosen from the first 3fc-|- 1 
colors of {1, 2, . . .}. 

Let Vi = {vj \ j < i} and = T<[Ci], the online subgraph of induced 
by Vi. 

In T^, Vi may be adjacent to single vertices from each of many subtrees. 
Note that, in each of the subtrees that becomes connected to Vi is disjoint 
from all other components of so any such subtree will have been colored 

only with colors from {1, 2, . . . , 3A: — 2}. 

The path vertex Vi can also be connected to (at most) two other path vertices 
already colored. If Vi is not connected to any other path vertex then Vi will be 
colored by First-Fit using a color chosen from the first 3/c — 1 colors of {1, 2, . . .}. 
If Vi is connected to only one other path vertex then Vi will be colored by First- 
Fit using a color chosen from the first 3/c colors of {1, 2, . . .}. If is connected to 
two other path vertices then Vi will be colored by First-Fit using a color chosen 
from the first 3fc -I- 1 colors of {1, 2, . . .}. □ 

Lemma 5. For each k > 0, there is an online tree T^, of pathwidth k, such 
that First-Fit will use 3/c -I- 1 colors to color T^. 

The proof given for Lemma 4 suggests a way in which to present a tree of 
pathwidth k that will require 3fc -|- 1 colors. 




3fc — 2 colors 



3k — 2 colors 



3k — 2 colors 



3k — 2 colors 



Fig. 4. Schema of online tree of pathwidth k on which First-Fit is forced to 
use 3fc -I- 1 colors. 



We now consider the “promise” problem for online coloring in the case of general 
graphs of pathwidth at most k. 

A graph G = (V, E) is an interval graph if there is a function which maps 
each vertex of V to an interval of the real line, such that for each u^v gV with 
u =/= V, tp{u) n ijj(v) yf 0 (u, ?;) € E. The function if is called an interval 
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realization for G. The relation between interval graphs and graphs of bounded 
pathwidth is captured by the following lemma (see [12], or [15]). 

Lemma 6. A graph G has pathwidth at most k if and only if G is a subgraph 
of an interval graph G' , where G' has maximum clique size at most k + 1. 

Kierstead and Trotter [11] have given an online algorithm that colors any 
online interval graph having maximum clique size at most fc + 1, using 
3/c + 1 colors. Thus, any graph of pathwidth k can be colored online using at 
most 3fc + 1 colors. Lemma 4, above, gives us a stronger result for trees - for 
trees of pathwidth k such a performance is possible using First-Fit. 

For general graphs of pathwidth at most fc, the situation for First-Fit is less 
well understood. First-Fit does have a constant performance ratio on graphs of 
pathwidth at most fc. Kierstead [9] has shown that for every online interval graph 
G^, with maximum clique size at most fc-|- 1, First-Fit will use at most 40(fc-|-l) 
colors. In [10] Kierstead and Qin have improved the constant here to 25.72. 

Chrobak and Slusarek [2] have used an induction argument to prove that 
there exists an online interval graph G'' , and a constant c, where c is the maxi- 
mum clique size of G^, such that First-Fit will require at least 4.4 • c colors to 
color G^ . Such an interval graph G^ will have pathwidth c — 1 and chromatic 
number c. Thus, the performance ratio of First-Fit on general graphs of path- 
width fc must be at least 4.4. However, experimental work by Fouhy [5] indicates 
that we can expect a performance ratio of 3 for First-Fit on randomly generated 
graphs of bounded pathwidth. 



6 Bounded Persistence Pathwidth 

In Section 3 we introduced a quite natural online presentation scheme that gives 
rise to graphs having bounded persistence pathwidth. We present the graph as a 
path decomposition, one node per timestep, where every vertex of the underlying 
graph belongs to at most I nodes of the path. The underlying idea is that a 
pathwidth 2 graph should look more like a “long 2-path” than a “fuzzy ball” . 

A related notion is domino treewidth introduced by Bodlaender and Engel- 
friet [1]. A domino tree decomposition is a tree decomposition in which every 
vertex of the underlying graph belongs to at most two nodes of the tree. Domino 
pathwidth is a special case of bounded persistence pathwidth, where 1 = 2. 

In the case of online coloring using First-Fit neither bounded persistence 
pathwidth, nor domino pathwidth, give us any real gain over our original path- 
width metric. However, the notion of persistence seems like it may be interesting, 
in the broader online setting, in its own right. 

In this section we present some basic results regarding the general recogni- 
tion of graphs having bounded persistence path decompositions. Note that, even 
though we will show that the recognition problem is hard, in many real life in- 
stances we may reasonably expect to “know” that the persistence is relatively 
low, and, indeed be given such a decomposition. 



Online Problems, Pathwidth, and Persistence 



23 



To demonstrate the hardness of basic problems associated with persistence, 
we will use the framework of parameterized complexity theory, introduced by 
Downey and Fellows [3] . We remind the reader that a parameterized language L 
is a subset of FI* x if* . If L is a parameterized language and (<t, k) G L then we 
refer to cr as the main part and k as the parameter. The basic notion of tractabil- 
ity is fixed parameter tractahility (FPT). Intuitively, we say that a parameterized 
problem is fixed-parameter tractable (FPT) if we can somehow confine the any 
“bad” complexity behaviour to some limited aspect of the problem, the pa- 
rameter. Formally, we say that a parameterized language, L, is fixed-parameter 
tractable if there is a computable function /, an algorithm A, and a constant c 
such that for all k, {x, k) G L iS A{x, k) = 1, and A{x, k) runs in time f{k)\xf (c 
is independent of k). For instance, /c-Vertex Cover is solvable in time 0(|a;|). 
On the other hand, for fc-TuRiNG Machine Acceptance, the problem of de- 
ciding if a nondeterministic Turing machine with arbitrarily large fanout has a 
A:-step accepting path, the only known algorithm is to try all possibilities, and 
this takes time J7(|a;|^). This situation, akin to A^P-completeness, is described by 
hardness classes, and reductions. A parameterized reduction, L to L' , is a trans- 
formation which takes {x,k) to {x',k'), running in time g{k)\x\‘^, with k k' a, 
function purely of k. 

Downey and Fellows [3] observed that these reductions gave rise to a hierarchy 
called the kF-hierarchy. 

FPT C IT[1] C kF[2] C . . . C W[t] C . . . . 

The core problem for W[l] is fc-TuRiNG Machine Acceptance, which is equiv- 
alent to the problem Weighted 3Sat. The input for Weighted 3Sat is a 
3CNF formula, ip and the problem is to determine whether or not ip has a satis- 
fying assignment of Hamming weight k. W[2] has the same core problem except 
that ip is in CNF form, with no bound on the clause size. In general, W[t] has as 
its core problem the weighted satisfiability problem for ip of the form “products 
of sums of products of ...” of depth t. It is conjectured that the VF-hierarchy is 
proper, and from kF[I] onwards, all parametrically intractable. 

In this spirit, we define the parameterized problem Bounded Persistence 
Pathwidth as follows: 

Instance: A graph G = {V, E). 

Parameter: A pair of positive integers (k, 1). 

Question: Is there a path decomposition of G of width at most k, 
and persistence at most 17 

Domino Pathwidth is a special case of this problem, where I = 2. 

The following results give strong evidence for the likely intractability of 
both the Bounded Persistence Pathwidth problem and the Domino Path- 
width problem. Note that these results mean that it is likely to be impossible 
to find FPT algorithms for either of these problems, unless an unlikely collapse 
occurs in the IF-hierarchy. The proofs of these results can be found in [15]. 
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Theorem 1. Bounded Persistence Pathwidth is W[t]-hard, for all t G Af. 
Theorem 2. Domino Pathwidth is W[2]-hard. 

7 Conclusions 

We consider the work presented in this paper to be a first step in a program to 
investigate the ramifications of online width metrics. Of course, there are many 
graph width metrics, along with many online problems, that are candidates 
for study. However, it does seem apparent that pathwidth, or metrics that are 
pathwidth-like, are a natural fit in this context. 
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Abstract. Consider the following problem, that we call “Chordless Path through 
Three Vertices” or Cp3v, for short: Given a simple undirected graph G = (V, E), 
a positive integer k, and three distinct vertices a, t, and n £ V, is there a chordless 
path from s via u to f in G that consists of at most k vertices? In a chordless path, 
no two vertices are connected by an edge that is not in the path. Alternatively, 
one could say that the subgraph induced by the vertex set of the path in G is 
the path itself. The problem has been raised in the context of service deployment 
in communication networks. We resolve the parametric complexity of Cp3v by 
proving it lL[l]-complete with respect to its natural parameter k. Our reduction 
extends to a number of related problems about chordless paths. In particular, de- 
ciding on the existence of a single directed chordless (s, f)-path in a digraph is 
also VL[l]-complete with respect to the length of the path. 

Keywords: graph theory, induced path, parameterized complexity. 



1 Introduction 

The number of specialized functions such as support for quality of service and protec- 
tion against denial-of-service attacks, that is being built into network nodes is growing 
continuously. Thus it is becoming increasingly difficult for network administrators to 
use such sophisticated capabilities fully, especially when new services must be deployed 
in a timely manner. The advent of reprogrammable network nodes, made possible by 
cost-efficient network processors, has aggravated this issue as new capabilities may be 
introduced into the network dynamically in order to provision a particular service. 

An automated method to perform service deployment was presented in [20], and 
several categories of services with similar deployment needs were introduced. This pa- 
per focuses on chordless paths: these are paths for which no two nodes are directly 
connected except along the path. Chordless paths are particularly relevant to the cate- 
gory of path-based services, i.e., services that require each node of a path from a source 
to a destination to be enabled with a common function. During the deployment of path- 
based services, nodes are queried for specific service requirements to determine whether 
they have the necessary capabilities to support a certain function. Such requirements are 
specific to each service, so that it is not recommended to let nodes advertise all their 
capabilities by default, which may even vary over time. Instead, on-demand query of 
relevant nodes is the preferred alternative. 
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Often it is necessary to query only a subset of the network nodes to determine 
whether a path with the required capabilities exists. If such a path contains a chord, 
there is a shorter path through the same set of nodes for this particular source/destination 
pair. Thus, for this type of query we are interested in only chordless paths. In particular, 
nodes that do not belong to any chordless path for a given source/destination pair are 
irrelevant and do not have to be queried. 

It has been shown that for typical Internet router-level topologies, a large fraction 
of nodes do not belong to chordless paths [20]. To avoid unnecessary queries, it would 
hence be desirable to decide the following problem efficiently. 

Problem 1 (Cp3v). Given an undirected graph G = (V, E), a positive integer k, and 
three distinct vertices s,t,v G V, is there a chordless path from s via v to t in G that 
consists of at most k vertices? 

But as we will prove in Section 4, this problem is Vk[l]-hard with respect to k\ that is — 
roughly speaking — it is unlikely that an algorithm exists whose complexity is bounded 
by an arbitrary (say, doubly exponential) function in k but polynomial in the size of the 
input graph. As another consequence [1, 8], there is probably no Ptas to compute an 
(1 + e) -approximation for the shortest chordless (s, v, f)-path in time bounded by an 
arbitrary function in £ but polynomial in the size of the input graph. 

After summarizing some related results, we complement the above-mentioned hard- 
ness claim in Section 3 by proving Cp3v to be in kk[l], that is, altogether Vk[l]- 
complete. Finally, Section 5 discusses how to extend our results for Cp3v to a number 
of problems concerning chordless paths in directed graphs. 

1.1 Related Work 

The following problem is a slight generalization of Cp3v. 

Problem 2 (Many Chordless (s, f)-Paths). Given a simple undirected graph G = 
(y, E), positive integers k and I, and two distinct vertices s,t G V, is there a set 
U C V of at most k vertices such that the subgraph induced by U in G is a disjoint 
union of £ chordless {s,t)-ptiths? 

This was shown to be NP-complete by Fellows [15], already for £ = 2, where it asks 
for a chordless cycle of length at most k through s and t. Let us refer to this problem 
as Cc2v. As any chordless cycle through s has to pass through one of the neighbors of 
s, an instance of Cc2v can be solved by less than \V\ calls to an algorithm that solves 
Cp3v. In particular, this implies NP-completeness of Cp3v. On the other hand, any 
Cp3v instance can be converted to a Cc2v instance in constant time: add a new vertex 
M to G and connect it to both s and f; now ask for a cycle through u and v of length at 
most k + 1. Hence, our VF[l]-completeness result for Cp3v also implies that Cc2v is 
FF[1] -complete. 

The hardness results for Cp3v and Cc2v rely on graphs that contain many vertex- 
disjoint (s, f)-paths. It is not difficult to see that in planar graphs the existence of four 
vertex-disjoint (s, f)-paths basically' implies the existence of a chordless (s, v, f)-path. 

' Except for some trivial cases which can be easily sorted out. 
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While this argument immediately gives an time algorithm for Cp3v (for some 

constant c G IN), the more interesting question is whether Problem 2 is polynomial 
for planar graphs. This was answered in the affirmative for every fixed I by McDi- 
armid et al. [23, 24]. 

If in Problem 2 we ask for vertex-disjoint paths only instead of requiring all paths 
to be jointly chordless, the problem is polynomial for general graphs and every fixed 
I, even for arbitrary source-target pairs 1 < i < £ [27]. But it remains Np- 

complete if £ is considered part of the input [21]. 

Deciding whether a graph contains a chordless path of length at least k is one of the 
classical NP-complete problems (Gt 23 in [19]). Bienstock [3, 4] listed several other 
NP-complete problems related to chordless paths: 

- Does a graph contain a chordless path of odd length between two specified vertices? 

- Does a graph contain a chordless cycle of odd length (> 3) through a specified 
vertex? 

- Does a graph contain a chordless path of odd length between any two vertices? 

Note that these results do not imply the hardness of deciding whether there exists any 
path/cycle of odd length in a graph. This question is still open, see the discussion below. 

Chordless cycles of length at least four are also called holes. They are tightly con- 
nected to Berge’s strong perfect graph conjecture [2], whose proof has recently been 
announced by Chudnovsky et al. [9]. According to this conjecture, a graph is perfect^ iff 
it is Berge, that is, if it contains neither an odd hole nor the complement of an odd hole. 
Hence, a polynomial time algorithm to decide whether there exists any odd hole in a 
given graph would immediately imply that perfect graphs can be recognized in polyno- 
mial time. Interestingly, no such an algorithm is known, although there are polynomial 
time algorithms [11, 10] to decide whether a graph is Berge, even independent of the 
strong perfect graph conjecture. Also, if the restriction to an odd number of vertices is 
omitted, the presence of holes can be detected in polynomial time: for holes on at least 
four vertices this is the well-studied recognition problem for chordal graphs [22, 28]. 
The problem of detecting holes on at least five vertices has recently been addressed by 
Nikolopoulos and Palios [26]. 

2 Notation 

For a graph G denote by V (G) the set of vertices in G, and denote by E(G) the set 
of edges in G. For a vertex v € V, denote by Na{v) the neighborhood of v in G, that 
is, the set of vertices from V that are adjacent to v in G. Similarly, for a set FF dV 
of vertices define Ng{W) := Uidgit ^g{w). The subscript is often omitted when it 
is clear which graph it refers to. A set / C 17 of vertices is an independent set in G if 
E does not contain edges between any two vertices of /. For a set IF C F of vertices 
denote by G\W] the induced subgraph of IF in G, that is, the graph (IF, E n (^)) or 
(IF, E n IF^) in the case of an undirected or a directed graph G, respectively. 

^ In a perfect graph, the maximum number of pairwise adjacent vertices (clique number) for 
each induced subgraph is equal to the minimum number of colors needed to color the vertices 
in such a way that any two adjacent vertices receive distinct colors (chromatic number). 
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A subgraph of G that has the form ({fi, . . . , Vk}, {{fi, fi+i} | 1 < * < ^}) is 
called path of length k in G. Note that by the set notation we imply that Vi vj, 
for 1 < i,j < k and that the length of a path is defined as its number of vertices. The 
vertices vi and Vk are referred to as the path’s endpoints, the other vertices are called 
interior. Two paths are called vertex-disjoint iff they do not share vertices except for 
possibly common endpoints. For two vertices s,t G V any path from s to f in G is 
called (s, f)-path. More generally, if vertices s, v, and t appear on path P in this order, 
we call P an (s, v, f)-path. A path P in G is called chordless if V (P) is an independent 
set in {V,E \ E{P)). An alternative equivalent definition would be to call a path P in 
G chordless iff G[V (P)] = P. Hence, such paths are also known as induced paths. 

Parameterized Complexity To cope with the apparent computational intractability of 
Np-hard problems, attempts were made to analyze more closely which parts or aspects 
of the input render a particular problem hard. A prototypical example is Vertex Cover, 
which asks for a set G of at most k vertices from a given graph G on n vertices such 
that for each edge at least one endpoint is in G. The trivial observation that for any 
edge at least one of the two incident vertices has to be in G, leads to an 0{2^n) time 
algorithm: choose an arbitrary edge and branch on the two possibilities, in both cases 
removing one vertex and all incident edges from the graph. Hence, the intractability of 
Vertex Cover is connected to the number k of vertices in the cover rather than to the 
size of the graph G. One says that Vertex Cover is fixed-parameter- tractable (Fpt) with 
respect to the parameter k because there is an algorithm that runs in 0{f{k)p{n)) for 
an arbitrary, typically exponential, function / and a polynomial function p. 

Naturally, there are also problems for which it is not known whether their com- 
plexity can be isolated into a particular parameter in this way. Moreover, similar to the 
classical complexity classes, there are classes of parameterized problems that are hard 
in the sense that if there is an Fp algorithm for any of them, then all of them are Fpt. 
The most important such class is called VF[1], which can be described in terms of the 
following “canonical” problem. 

Problem 3 (Weighted g-CNF-Satisflability). Given positive integers q and k, and a 
boolean formula F in conjunctive normal form such that each clause contains at most 
q literals, is there a satisfying assignment for F with at most k variables set to true? 

A problem P parameterized by k is said to be m-reducible to a problem P' parame- 
terized by k' iff there is a function h that maps an instance (x, k) of P to an instance 
(x', k') of P' such that fc' = g{k) and x' can be computed in time f{k)p{x), for arbi- 
trary functions / and g, and a polynomial function p. Now W[l] is defined as the class 
of parameterized problems that can be m-reduced to Weighted g-CNF-Satisfiability for 
some constant q. Finally, a problem is W[l]-hard iff every problem in W[l] can be m- 
reduced to it. A problem that is both VF[l]-hard and in W[l] is called W[l]-complete. 

At this point, we refer the interested reader to the literature for more in-depth in- 
formation about parameterized complexity. The book of Downey and Fellows [13] pro- 
vides a thorough treatment of complexity-theoretic aspects, whereas the survey of Nie- 
dermeier [25] focuses more on algorithms. 
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3 Membership in W[l] 

In this section we analyze the parameterized complexity of Cp3 V and prove the problem 
to be in W[l] with respect to its natural parameter, the path length. As a first step, note 
that a chordless (s, f)-path P is already determined by its set of vertices. For example, 
only one neighbor x of s can be in V (P) because any later visit of another neighbor 
would introduce a chord. Similarly, exactly one neighbor of x (other than s) can be in 
V{P). ln this manner P can be uniquely reconstructed from V{P). 

Proposition 4. A subgraph P of G is a chordless (s,t)-p‘^th if and only if P is con- 
nected, s and t have degree one in G\V (P)], and all vertices other than s and t have 
degree two in G[V (P)]. □ 

We do not know how to reduce Cp3 V to Weighted q-CNF-Satisfiability. Instead, we 
reduce to a different problem called SnTmc that is defined below. SnTmc is known to 
be VF[l]-complete [5, 14], and reduction to SnTmc and its relatives has proven to be a 
useful tool to establish membership results within the FF -hierarchy [6, 7]. 

Problem 5 (Short Nondeterministic Turing Machine Computation (SnTmc)). 

Given a single-tape, single-head nondeterministic Turing machine M, a word x on the 
alphabet of M, and a positive integer k, is there a computation of M on input x that 
reaches a final accepting state in at most k steps? 

Theorem 6. Cp3v is in W[l] w.r.t. k. 

Proof. Consider an instance {G, s,v,t, k) of Cp3v, where G = (V,E) is a simple 
undirected graph, s,v,t G V, and /c is a positive integer. We will construct an instance 
(M, k') of SnTmc such that there is a computation for M that reaches a final accepting 
state in at most k' = k^ -\-3k—l steps iff there exists a chordless (s, v, f)-path of length 
at most k in G. (It is important that E depends on k only and not on n.) A schematic 
view of the construction is shown in Fig. 1. 

Let M = (27, Q, A,gi, {A}), where the alphabet S is defined as S := {□} U 
{cr„ I M G y}; the state set is 

Q \= {A, R} U {gi\l <i <k} U {a,b,c,d,l,r} U {p„ \ u &V} U {qu\u & V} ] 

the transition relation A: QxSxQxSx {+, — , 0} is defined below; the initial 
state is (/i; the final accepting sfafe is A, and fhe final rejecting sfafe is R. When fhe 
Turing machine starts, all tape cells contain the blank symbol (□). The computation 
consists of three phases: first, the at most k vertices of a chordless (s, v, f)-path P in G 
are “guessed” by writing the sequence of corresponding symbols (t„, u G V{P), onto 
the tape. The next two phases are completely deterministic and check that P visits s, v, 
and t in the order given, and that P is a chordless path in G. 

First Phase: The Turing machine may write up to k arbitrary vertex symbols onto the 
tape: {gi, D,gi+i,au,+) G A, for all u G V and all 1 < z < fc, and {gi,D, a, cr„, 0) G 
A, for all u G V and all 1 < z < /c. (The transition specifies, in order, current state, 
symbol under the head, new state after transition, symbol to write to the tape, and move- 
ment of the head: + for right, — for left, and 0 for stay.) After the first phase, the Turing 
machine is in state a and the sequence of between one and k vertex symbols starts at 
the current tape cell, extending to the left. 
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Second Phase: Check whether the guessed sequence visits t, v, and s, in order. The 
rightmost symbol should heat', {a, at, b, at,—) G A. Then somewhere ay must appear: 
(b, ay,b, cr„, — ) G A, for all u G C \ {u}, and (6, cr„, c, ay,—) G A. The final symbol 
has to be as', (c, cr„, c, (t„, — ) G A, for all tt G V \ {s}, and (c, ag,d, as, — ) G A. 
Nothing may follow after s: (d, □,!,□,+) G A. For all state/symbol combinations 
that are not explicitly mentioned (for example, (6, □) or {a, ay)) there is a transition 
to the final rejecting state R. After the second phase, the machine is in state I and the 
head points towards the leftmost of the symbols that have been guessed in Phase 1 . The 
content of the tape remains unchanged during Phase 2. 

Third Phase: Scan and remove the first vertex: [I, cr„,p„, □, +) G A, for all u G V . 
If no more vertex is left at this point, we are done: (p„,D,A, 0,0) G A, for all 
u G V. Otherwise, the next vertex should be adjacent: (pu, cr^j, cr^j, +) G A, 
for all u,w G V for which {u,w} G E. Whatever follows must not be adjacent: 
(<Z«, o-y,,qu, ay,, +) G A, for all u,w G V with u ^ w and {u, w} ^ E. If all vertices 
have been checked, return to the leftmost: (q„, □, r, □, — ) G A, for all u G V, and 
(r, aw,r, ay,, — ) G A, for all w G V. Finally, re-iterate: (r, □, I, □, +) G A. Again, all 
state/symbol combinations that are not explicitly mentioned lead to the final rejecting 
state R. Note that after the third phase, all tape cells contain the blank symbol again. 

Phase 3 ensures that all vertices guessed in Phase 1 are distinct, as, otherwise the 
right scan in state g„, for some u G V, fails. Moreover, because of the transition from 
Pu to Qu, the vertices chosen form a path P in G. The right scan in state also ensures 
that no two of the vertices are connected except along P. Finally, in Phase 2 we check 
that the endpoints of P are s and t, and that P visits v. Altogether, the machine reaches 
an accepting state iff it guesses the vertices of a chordless (s, v, f)-path of length at most 
k in Phase 1 . An easy calculation reveals that if k symbols are written onto the tape in 
Phase 1, the remaining computation consists of exactly k^ + 2k — 1 transitions. □ 

4 Hardness for VF[1] 

In this section, we prove that Cp3v is FF[l]-hard using a reduction from Independent 
Set, which is one of the “classical” FF[l]-hard problems [12]. 

Problem 7 (Independent Set). Given a simple undirected graph G = (V, E) and a 
positive integer k, is there an independent set of size at least k in G? 

Consider an instance of Independent Set, that is, a graph G = {V, E) and an integer 
k, I < k < \V\, and let V = {fi, . . . , Vn}. We construct a graph G' from G such 
that the answer to the Cp 3 V problem on G' provides the solution to the independent- set 
problem on G. 

The main ingredient for our construction is called vertex choice diamond: it consists 
of n vertices v\, . . . , plus two extra vertices s* and P connected to each of the n 
vertices f], 1 < j < n, as shown in Fig. 2. Clearly, there are exactly n chordless 
(s*, f*)-paths in such a diamond. As the naming of the vertices suggests, we associate 
each of these paths with a vertex from G in a bijective manner: routing a path through 
Vj , for some 1 < j < n, is interpreted as selecting vj to be part of the independent set 
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Fig. 1. A schematic description of the Turing machine defined in the proof of Theo- 
rem 6. The transition arrows are labeled by. In order, symbol under head, symbol to 
write, and head movement. To increase readability the final rejecting state and all tran- 
sitions to it have been omitted. 

/ to be constructed. The construction uses k such vertex choice diamonds, which are 
connected by identifying and f®, for all 1 < i < A:. Let us call the graph described 
so far Gyc, where VC stands for vertex choice. 

Proposition 8. Any chordless (s^, t^)-path of length i in Gyc corresponds to a set of 
£ — k — 1 vertices in G. □ 






Fig. 2. A vertex choice diamond and three of its n chordless (s®, f*)-paths. 

The next step is to ensure that the vertex sets chosen by traversing Gyc on a chord- 
less path correspond to independent sets in the original graph G. To accomplish this, 
we construct G' from two symmetric copies of Gyc- Denote the vertices in the first 
copy C of Gyc by s®, Vj, and f®, whereas the vertices in the second copy F of Gyc are 
referred to as <t% , and t% for 1 < i < k and I < j < n. The graphs C and F are 
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connected by identifying and The construction of G" is completed by adding a 
number of edges that encode the adjacency of G. An example is shown in Fig. 3. 

- There is an edge in G' between and for all 1 < i < /c and all I < j,i <n 
with j ^ i. Such an edge is called consistency edge. 

- For every edge {vp,Vq} in G, connect the vertex sets {fp, and {t;^, for all 
1 <i,j < k with i ^ j, by a complete bipartite subgraph in G' . These edges are 
called independence edges. 

- The vertices v\ and ip\ are connected by an edge in G' to all of the vertices and 

for all 1 < i, j < k with i ^ j. These edges are called set edges. 




(c) G' (independence edges only). 



Fig. 3. An example illustrating the construction of G' for k = 2. Consistency edges 
are shown by solid lines, independence edges by dashed lines, and set edges by dotted 
lines. The vertex labels in G' indicate the correspondence to the vertices from G: for 
example, the vertices labeled “1” correspond to vi. The vertex is shaded dark. 



Lemma 9. No chordless (s^, a^)-path via in G' uses a consistency edge or an inde- 
pendence edge or a set edge. 

Proof. Let P be a chordless (s^, cr^)-path via in G' . Consider the initial part of P 
which traverses (part of) the first vertex choice diamond of C. By construction of G' , 
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exactly one of the vertices wj, 1 < j < n is on P. (All of these vertices are neighbors 
of s^.) Similarly, the final part of P contains exactly one of the vertices ip}, 1 < £ < n. 
Moreover, if £ ^ j vertices vj and pj are connected by an edge in G' . Hence, they 
together with and induce an (s^, (j^)-path in G' that does not visit v and cannot 
be extended to a chordless path visiting v either. Thus, we may conclude that £ = j. 

Furthermore, note that by construction wj and pj have the same neighbors along 
both independence and set edges. Thus, if P continues along an independence edge or 
set edge from either vertex, it has to do so from the other vertex as well, as otherwise 
the unused edge would form a chord, but, as above, P cannot reach in this case. 

In summary, the initial part of P goes from via uj, for some 1 < j < n, to 
= s^, and the final par! of P is completely symmetric: from via pj to By 
induction on k, the initial part of P is an (s^, f^)-path Q that visits all s* in increasing 
order, for 1 < i < n, without using any consistency, independence, or set edge, and the 
final part of P is a (t^, (T^)-path that is completely symmetric to Q. □ 

Theorem 10. Cp3v is W[l]-hard w.r.t. k. 

Proof. Given an instance (G, k) of Independent Set, we construct the graph G' as de- 
scribed above. The graph G' contains 2/c(n + 1) + 1 vertices. To compute the number 
of edges in G' note that there are Akn edges in the 2k vertex choice diamonds, plus 
kn{n — 1) consistency edges, Amk{k — 1) independence edges, and 2nk{k — 1) set 
edges, where n := \V (G)| and m := |P(G)|. Hence, G' can be constructed from G in 
time and space polynomial in both n and k. 

Let P be a chordless (s^, (T^)-path via of length at most 4fc + 1 in G'. By 
Lemma 9, P has a very special form: in particular, its length is exactly 4fc + 1, and 
it visits exactly one vertex from each of the vertex choice diamonds with 1 < i < n. 
Let / := {vj^ G V (G) | Vj. € V (P) for 1 < i < n}. Suppose that u*. = for some 
1 < i,£ < k with i £. As Vj. and are connected by a set edge in G' that is not in P 
by Lemma 9, this set edge forms a chord of P, in contradiction to our assumption that 
P is chordless. Therefore, the vertices vf visited by P correspond to mutually distinct 
vertices in G, that is, together with Proposition 8 it follows \I\ = 2k + I — k — 1 = k. 

Furthermore, we claim that I is an independent set in G. Suppose that for two 
vertices and on P, 1 < z, ^ < A: and i £, the corresponding vertices and Vj^ 
are neighbors in G. Then by construction . and are connected by an independence 
edge in G'. Again, this edge is not in P by Lemma 9, that is, it forms a chord of P, in 
contradiction to our assumption that P is chordless. Therefore, no two vertices in I are 
adjacent in G. 

Conversely, it is easy to see that for any independent set I = {ui, f 2 , • ■ • , Vk, ■ ■ •} 
(without loss of generality) of size at least A: in G there is a chordless (s^, (T^)-path P 
via of length 4A; + 1 in G': in the z-th vertex choice diamonds, P visits vl and p\, for 
l<i<k. 

Therefore, we have a parameterized reduction from an independent set instance 
(G, k) to a Cp3v instance (G', 4A: + 1), establishing VF[l]-hardness of Cp3v. □ 
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5 Chordless Paths in Digraphs 

The notion of chordless paths generalizes in a straightforward manner to digraphs. 

Problem 11 (Directed Chordless (s,f)-Path (Dcp)). Given a simple digraph G = 
(y, E), a positive integer k, and two distinct vertices s,t G V, is there a chordless 
directed (s, t)-path of length at most k in G? 

Fellows et al. [16] showed that DCP is NP-complete even if restricted to planar digraphs. 
Our constructions described above can easily be adapted to the directed setting. 

Theorem 12. Dcp is W[l]-complete w.r.t. k. 

Proof. In the Turing machine of Theorem 6 replace all conditions that require the exis- 
tence of an edge by corresponding conditions requiring the presence of a directed edge. 
Similarly, all conditions requiring the absence of an edge are replaced by corresponding 
conditions disallowing both directed edges. 

The construction described in Theorem 10 is modified as follows. In the vertex 
choice diamonds direct all edges from s* to u* and from u* to P, for all 1 < i < A: and 
all 1 < J < In the symmetric copy, direct all edges from r* to and from to 
a\ for all 1 < i < k and all 1 < j < n. These orientations induce a linear ordering 
(s^, . . . , = r^, . . . , CT^) on y {G'). The remaining edges, that is, the consistency, 

independence, and set edges all are oriented from the vertex that is greater with respect 
to this linear order to the smaller vertex. It is easy to verify that no chordless directed 
(s^, (T^)-path can use a consistency, independence, or set edge. In fact, the orientation 
is chosen such that any chordless directed (s^, (T^)-path in G' passes through = r^, 
although this is not required by definition, in contrast to Cp3v. □ 

As a consequence, also the following problem is iy[l] -complete w.r.t. k. (Just add a 
single directed edge (<t^, s^) to the construction described in Theorem 12.) 

Problem 13 (Directed Chordless Cycle). Given a simple digraph G = (y, E), a pos- 
itive integer k, and a vertex s G y, is there a chordless directed cycle of length at most 
k through s in G? 

Note that both problems are polynomial if the path or cycle is not required to be chord- 
less: the maximum number k of vertex-disjoint directed (s, f)-paths can be computed in 
0{k\E\) time using flow techniques [17]. However, deciding whether there exist a di- 
rected (si, fi)-path and a directed (s 2 , f 2 )-path that are vertex-disjoint is NP-complete, 
even for ti = S 2 and t 2 = si [18]. 

Also, if the definition of chordless is relaxed to allow “back-cutting” arcs within 
each path, DCP restricted to planar graphs is polynomial, even for an arbitrary but fixed 
number of chordless (s,f)-paths [24]. The existence of such arcs is the crucial differ- 
ence between the directed and the undirected problem: in an undirected (s, f)-path P 
every edge joining two vertices that are non-adjacent along P can be used as a shortcut. 
That is, the presence of any (s, f)-path implies the existence of a chordless (s, f)-path. 
However, we will show below that admitting back-cutting arcs does not change the 
parametric complexity of the problem for general graphs. 
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Definition 14. An (s, t)-path P in a graph G = {V, E) is called weakly chordless ijf 
P is a shortest (s, t)-path in G\V (P)]. 

Observe that there is no difference between chordless and weakly chordless in undi- 
rected graphs. But, in contrast to DCP, the presence of a directed weakly chordless 
(s, f)-path can be decided in linear time by a breadth-first search. However, the gener- 
alization to several paths defined below is again TP [1] -complete, already for two paths. 

Problem 15 (Many Weakly Chordless (s, t)-Paths). Given a simple digraph G = 
(V, E), positive integers k and t, and two distinct vertices s,t € V, is there a set U GV 
with \U\ < k such that G\U] is a disjoint union of£ weakly chordless (s, t)-paths? 

Theorem 16. Two Weakly Chordless (s, t)-Paths is W[l]-complete w.r.t. k. 

Proof. It is clear how to adapt the Turing machine construction of Theorem 6 to estab- 
lish membership in W[l]. 

The construction described in Theorem 10 is modified as follows. First, add a di- 
rected edge from to s^, and let s := and t := . In the vertex choice 

diamonds direct all edges from P to Vj and from Vj to s*, for all 1 < i < fc and all 
£ < j < n. Likewise, in the symmetric copy direct all edges from r* to and from 
(p* to cr*, for all 1 < i < k and all 1 < j < n. Remove all independence and set edges 
within the same diamond chain, such that all remaining independence or set edges are 
between u* and ipP, for some 1 < i,p < k and 1 < j,q < n. Direct those edges from 
Vj towards ipP. 

Consider s^)-paths P and Q in G' such that G[V (P)UV{Q)] is a disjoint union 
of weakly chordless s^)-paths. The way the edges are directed, one of the paths, say, 
P comes via the vertex choice diamonds and visits the vertices , s^, s^, 

in order. On the other hand, Q traverses the symmetric copy and visits the vertices 
, . . .r^,(T^,in order, before finally reaching via the added edge. Because 
there must not be any edge between P and Q, we can argue as in Theorem 10 that the 
vertices u* and ipP visited by P and Q, respectively, correspond to an independent set 
of size at most fc in G. □ 
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Abstract. Let G = (V, E) be a graph. The linear-width of G is defined 
as the smallest integer k such that E can be arranged in a linear ordering 
(ei, . . . , 6r) such that for every i = 1, . . . , r— 1, there are at most k vertices 
both incident to an edge that belongs to {ei,...,ei} and to an edge 
that belongs to {ci+i, . . . ,6r}. For each fixed constant k, a linear time 
algorithm is given, that decides for any graph G — (U, E) whether the 
linear-width of G is at most k, and if so, finds the corresponding ordering 
of E. Linear-width has been proved to be related with the following graph 
searching parameters: mixed search number, node search number, and 
edge search number. A consequence of this is that we obtain for fixed k, 
linear time algorithms that check whether a given graph can be mixed, 
node, or edge searched with at most k searchers, and if so, output the 
corresponding search strategies. 

Keywords: linear-width, pathwidth, treewidth, characteristics, graph searching. 

1 Introduction 

In this paper, we study algorithmic aspects of the graph parameter of linear- 
width. Apart from having interest on its own, this parameter has close relation- 
ships with some well known graph searching parameters: mixed search number, 
node search number, and edge search number, as well as well known parameters 
related or equal to these parameters, like pathwidth, vertex separation number, 
and proper pathwidth. The results for linear- width allow us to rederive some 
old results for some of these parameters, and obtain similar new results for the 
other parameters, basically all as consequences of one algorithm. 

The linear-width of a graph G is defined to be the least integer k such that 
the edges of G can be arranged in a linear ordering (ei, . . . , e^) in such a way 
that for every i = l,...,r — 1, there are at most k vertices that incident to 
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at least one edge that belongs to {ei, . . . , e,} and that are incident to at least 
one edge that belongs to {ci+i, . . . , e^}. Linear- width was first mentioned by 
Thomas in [28] and is strongly connected with the notion of crusades introduced 
by Bienstock and Seymour in [2]. Linear- width can be seen as “a linear variant 
of branch- width” , in the same way as pathwidth can be seen as “a linear variant 
of treewidth” . In [25] , it is proved that several variants of problems appearing 
on graph searching can be reduced to the problem of computing linear-width. 

In a graph searching game a graph represents a system of tunnels where an 
agile, fast, and invisible fugitive is resorting. We desire to capture this fugitive 
by applying a search strategy while using the fewest possible searchers. Briefly 
said, the search number of a graph is the minimum number of searchers a search- 
ing strategy requires in order to capture the fugitive. Several variations on the 
way the fugitive can be captured during a search, define the the parameters of 
the edge, node, and mixed search number of a graph (namely, es(G), ns(G), and 
ms(G)). The first graph searching game was introduced by Breisch [7] and Par- 
sons [18] and is the one of edge searching. Node searching appeared as a variant 
of edge searching and was introduced by Kirousis and Papadimitriou in [14]. 
Finally, mixed searching was introduced in [24] and [2] and is a natural generali- 
sation of the two previous variants (for the formal definitions see Section 4 - for 
other results concerning search games on graphs see [1,8,9,15,16,23].) 

The problems of computing es(G), ns(G), ms(G), or linear-width(G) is NP- 
complete (see [16,14,24,25]). On the other hand, since all of these parameters 
are closed under taking of minors, we know (see e.g. [3,20,19,22,21]) that, for 
any k, there exists a linear algorithm that given a graph G checks whether 
es(G), ns(G), ms(G), or linear- width (G) is at most k. In other words, all these 
parameters are fixed parameter tractable by a linear parameterized algorithm 
(i.e. an algorithm of time complexity 0{f{k)n) where / is a function not de- 
pending on n). Unfortunately, the above result is not constructive, i.e. does not 
provide a way to construct the corresponding algorithm (see [10,11]). There- 
fore, it is highly desired to have constructive “fixed parameter results” for the 
aforementioned parameters. 

In this paper we carry out the above task by constructing such a linear pa- 
rameterized algorithm for linear-width. This algorithm can be directly transfered 
to a linear parameterized algorithm for node, edge, and mixed search number 
thanks to their connection (see [25]) with linear-width. 

So far, such a linear time algorithm has been constructed (see [3,4]) only for 
the parameters of treewidth and pathwidth (actually, the result in [3,4] can be 
directly transfered to the node search number which is known to be equal to the 
pathwidth plus one - see [12,13,17]). To be precise, [3,4] state that for fixed k, 
one can determine in linear time whether a given graph has pathwidth at most k, 
and if so. And a path decomposition of minimum width. This algorithm first finds 
a tree decomposition of width at most k, if it exists (if not, then the pathwidth is 
also larger than k), and then uses this tree decomposition to solve the problem, 
using the result in [4] that states that for fixed k and I, one can test whether 
the pathwidth of a graph G is at most I, and if so. And a minimum width path 
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decomposition, assuming that G is given together with a tree decomposition 
of width at most k. This paper uses a similar idea: we first determine a path 
decomposition of bounded width (if such a path decomposition does not exist, 
we know that the linear- width is also not bounded), and then apply the main 
algorithm, presented in this paper, that, given such a path decomposition, solves 
our problem. It can actually be avoided to work with tree decompositions by a 
modification of the algorithm in [3]. The main algorithm of this paper further 
develops the main technique of [4] that makes use of a special type of data 
structures called “sets of characteristics” . In general, such a structure filters the 
essential information on the existence of a bounded width layout of a the graph 
with respect to some of the nodes of the given path-decomposition. 

We stress that the general technique of defining “sets of characteristics” has 
a long history of development and applications. Apart from treewidth and path- 
width, it has been used on the design of linear time parameterized algorithms for 
vertex layout parameters like cutwidth and carving- width [26] . Moreover, it was 
also used in [27] for a polynomial algorithm for the pathwidth of graphs with 
bounded treewidth and maximum degree. Moreover, its kernel ideas have been 
generalized so that a systematic derivation of linear parameterized algorithms 
is possible for a general class of graph parameters including directed cutwidth, 
modified cutwidth, directed modified cutwidth, and directed pathwidth. How- 
ever, none of the previous results was able to cover the case of linear- width and 
the graph search parameters that we examine in this paper. Intuitively, the rea- 
son is that these parameters are defined using edge orderings instead of vertex 
orderings. Indeed it appears that the definition of a characteristic for edge or- 
derings cannot be derived by a straightforward generalization of the previous 
results. In this paper we define a special (and non-trivial) type of “characteris- 
tics” for the case of linear-width which. This makes it possible to derive, in a 
uniform way, a linear time parameterized algorithm for the three classic search 
parameters. 

An other parameter related to linear-width is branch-width. In another pa- 
per [5], we give a similar algorithm for branch- width. That algorithm uses the 
techniques of this paper as a building block for a more complicated algorithm. 

This paper is organized as follows. In Section 2, we give some preliminary 
results and definitions. The main algorithm is presented in Section 3. The con- 
sequences of the main algorithm can be found in Section 4. Due to space restric- 
tions, all the proofs are omitted in this extended abstract (see [6] for details). 

2 Definitions and Preliminary Results 

We first give a number of definitions and notations, dealing with sequences (i.e., 
linearly ordered sets) ut = (wi, . . . , iVr) of objects from some set O. In this paper 
O can be a set of numbers, sequences of numbers, vertices, vertex sets, or edges. 
All sequences will be indexed starting by 1 except from the case were we have 
either sequences of vertices or sequences of sequences of numbers that will be in- 
dexed starting by 0 (we adopt this convention because it will facilitate the presen- 
tation of our results). Given a sequence uj and two of its indices i,j where i < j, 
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we use the notation to denote the subsequence {oji, . . . ,ujj) ofoj. Also, if “o” 
is an operation defined on O, we use for any to' G O, the notation (w)ijou;', i < j, 
to denote the sequence {ujiouj', . . . ,u>j ou>'). Given two sequences w*, z = 1, 2, de- 
fined on O, where w* = (w*, . . . , * = 1, 2 we set = (WgOWg, . . . , 

Also, given two sequences Ui = (w* . , . . . , w* J , z = 1,2 (not necessarily of the same 
length) we define their concatenation 0 , . . . , , . . . , (we 

stress that in the last two definitions, s = 1 except from the cases where we 
consider either sequences of sets or sequences of sequences of numbers where 
s = 0). We finally denote the length of a sequence w by |w|. 

Unless mentioned otherwise, we will assume all graphs considered in this 
paper to be undirected and without parallel edges or self-loops. Given a graph 
G = (V, E) we denote its vertex set and edge set with V (G) and E{G) re- 
spectively. If V' C V{G), we call the graph {V',{{v,u} G E{G) : v,u G V'}) 
the subgraph of G induced by V and we denote it by G[U']. For any edge set 
E C A(G) we denote by V{E) the set of vertices that are incident to edges of 
E (i.e. V{E) = Uee_Ee). The degree of a vertex v in graph G is the number of 
edges containing it and is denoted by da{v). We call a vertex pendant when it 
has degree 1. We call an edge of a graph pendant when it contains a pendant 
vertex. 

Pathwidth: A path decomposition of a graph G = (U, e) is a sequence X = 
(Ao, . . . ,X\x\) of subsets of V{G) such that (i) V = Uo<i<|JC| (^0 for any 
{v, zc} G e, there is an z, 1 < z < |A|, where {v, zc} C Aj, and (iii) for all z, z', z", 
0 < z < z' < z", Ai n Xiii C Xii. The sets Aj are called the nodes of the path 
decomposition. The width of a path decomposition A=(Ao,l<z<|A|) equals 
maxo<i<|x|{|Aii| — 1}. The pathwidth of a graph G is the minimum width over all 
path decompositions of G. If A = (A^, 0 < z < |A|) is a path decomposition of a 
graph G, we say that A is nice if |Ao| = I and Vi_i<i<|x| (Aj — Aj_i) U (Aj_i — 
Aj)| = I. It is easy to see that for any constant k, given a path decomposition 
of a graph G that has width at most k and 0(|U (G)|) nodes, one can find a nice 
path decomposition of G that has width at most k and at most 2|U(G)| nodes 
in 0(|U (G)|) time. 

Let Aj be a node of a nice path decomposition A such that z > I. We say 
that Aj is an introduce {forget) node if |Aj — Aj_i| = I (|Aj_i — Aj| = I). It is 
easy to observe that any node Aj,z > I of a nice path decomposition is either 
an introduce or a forget node. 

Linear-width: The linear-width of a graph is defined as follows. Let G be a 
graph and I = (ci, . . . , e^) an ordering of E{G). For z = 1, . . . , r — 1, we define 
5i{ei) = V{{l)i,i) n V{{l)i+ix) (i.e. <5;(ej) is the set of vertices in V{G) that are 
endpoints of an edge in (?)iy and also of an edge in (0i-i-i,|£;(G)|)- The linear- 
width of an ordering I is maxi<j<r{|<5i(ej)|}. The linear-width of a graph is the 
minimum linear-width over all the orderings of E{G). It is not hard to prove 
that for every graph G, pathwidth(G) < linear- width (G) (see [6]). 

Given an edge ordering I we define P{1) = (0, (5/(ei), . . . , <5;(er-i), 0). More- 
over, we define Q{1) = ((|P(0)|), . . . , (|F’(r)|)) Notice that P{1) is a sequence of 
sets and Q(^) is a sequence of sequences of numbers each containing only one 
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element. We use this, somewhat overloaded, definition for reasons of consistency 
with the terminology of a characteristic that will be introduced later. We finally 
define the pendant sequence of I as H{1) = (0, Ci n A{G), ...,6^0 A{G)) where 
A{G) = {u G V{G) : daiv) = 1} (i.e. for 1 < z < r the element of H{1) indexed 
by z is the set of pendant endpoints of Cj). 

Given a vertex set V, a vertex x € V, and a sequence of vertex sets I = 
(Iq, . . . , Ir) where li C V,z = 0...,r and x G Uj=o,...,rdi, we define Fj{x) = 
min{m, 0<m<r\xG Im} and Lj{x) = max{m, 0<m<r\xG Im}- 

3 A Decision Algorithm for Linear-Width 

In this section, we give for every pair of integer constants k, I, an algorithm that, 
given a graph G with a path decomposition of width at most I, decides whether 
G has linear- width at most k. 

Sequence of integers: If ^ = (ai, . . . , a^) is a sequence of integers, we define 
max(^) = maxi<i<r{max(ai)} and for any integer t we set A + t = (ai -I- 
t, . . . , Qr+t). The typical sequence t{A) of a sequence of integers A is the sequence 
obtained after iterating the following operations, until none is possible any more. 

(i) If for some z,I < z < r — 1 ai = Oj+i, then set A = (ai, . . . , Ur) <— 

(g. 1 , . . . , Uj , ai-j-2 , ■ . . , a^)- 

(ii) If the sequence contains two elements oz and aj such that z-|-2 < j and either 

^ ^ or '^k,i<^k<.j ^ ^ j then set A — (ui, . . . ,ZZr) ^ 

(g. 1 , . . . , (Zj , Uj , . . . , (Zj.) . 

As an example, if A = (5, 7, 6, 7, 7, 7, 3, 4, 3, 5, 4, 3, 6, 8, 2, 9, 3, 4, 9, 6, 7, 2, 7, 5, 
4, 4, 4, 7, 4, 6, 4), then r(A) = (5, 7, 3, 8, 2, 9, 2, 7, 4). 

We define the set E{A) of the extensions of A as the set containing any set A* 
such that A can be obtained by A* after applying operation (i) of the definition 
of a typical sequence until this is not possible any more. More formally: 

^(A) = {A* = (a*, . . . , a^.) | = “d- 

Let A, B be two typical sequences where A = (ai, . . . , Ur) and B = (bi, . . . ,br) 
(notice that |A| = \B\ = r). Then we say that A < i? if Vi_i<i<|^l Ui < bi. We 
also say that A ^ B ii there exist extensions A* G E(A), B* G E{B) such that 
|A* I = |i3*| and A* < B* . For example if A = (5, 7, 4, 8) and B = (1, 7, 2, 6, 4) 
then B ^ A because A* = (5, 7, 7, 7, 4, 8, 8, 8, 8) is an extension of A, B* = 
(1, 7, 2, 6, 4, 4, 4, 4, 4) is an extension of A, and B* < A*. 

Suppose now that A = (Aq, . . . , A^) and B = {Bq, ■ ■ ■ , Br) are two equal 
length sequences of sequences. We say that A ^ B if Vi,o<z<r Ai ^ Bi. Fi- 
nally, for any integer t we set A -|- t = (Aq + t,. . . , Ar + t) and max(A) = 
maxo<j<r{max(Ad}. 

Characteristics of a typical triple: Let A be a path decomposition of a 
graph G and let Xi be a node in X where 0 < z < |A|. Let S = {Sq, . . . , Sr) be a 
sequence of vertex sets where Sj C Aj, 0 < j < r, D = {Dq, . . . , Dr) a sequence 
of subsets of Aj where \Dj\ < 2,0 < j < r and yj,o<j<i<\D\Ei n Dj = 0, 
and T = (Tq, . . . , T^) be a sequence of typical sequences. We call such a triple 
(S', D, T) a typical triple of Aj. 
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Given a typical triple {S, D, T), of Xi we define its characteristic C{S, D, T) 
as a typical triple (/, K, A) defined as follows. 

Let (rg , . . . , Tg+i) be the sequence with 0 = ro < • • • < = r + 1 where 

(a) ^ j,o<j<q h,rj<h<Tjj^i {Sh — Sj-j and Dfi = 

(b) '^j,0<j<q (5'rj + i 7^ Srj Or L)rj+i 7 ^ ^rj)- 

We call the pair {I,K) where / = {Sro, ■ ■ ■ , Sr^) and K = {Dg, . . . , Dr^) the 
interval model of S and D. Notice that, as H Dj = 0, we can 

observe that if yf 0 then r^+i = rj + 1 and Vj-i = Vj — 1 . 

For any j, 0 < j < q, let Aj = {Tr^ 0 ••• 0 We also set A = 

(Aq, . . . , Aq) ^ (r(Ao), . . . , T{Aq)) and we call the typical triple (/, K, A) the 
characteristic of the typical triple (S', D, T). Given two typical triples (S*, D*, T*), 
j = 1,2 we say that (S^,H^,T^) = (S^,,I?^,T^) if they have the same char- 
acteristic i.e. C(S^,H^,T^) = C{S^,D‘^,T^). We also say that (S^,H^,T^) ^ 
(S^,H^,T^) if S^ = S^, (i.e. they have the same interval model), 

and T^. It is easy to see that relation “ 0 ” is transitive i.e. if 

and then T^. We also extend the definition of “ 0 ” so 

that whenever (S*, H*, T*), z =1,2 are two typical triples, the typical triple 
(Si 0 S 2 , L)i 0 £> 2 , Ti 0 T 2 ) is denoted by (S\D\T^) 0 {S^,D‘^,T^). 
Characteristic of an ordering: Let A be a path decomposition of a graph 
G and Xi some node of X where 0 < z < \X\. For z = 0, . . . , |A|, we define 
V^ = Ug<j<.X, and Gi = G[V^. 

Let I be an edge ordering of Gi with linear- width at most k. Let also P{1) = 
{Pg , . . . , Pr). We define the restriction of I on Xi as R{1) = {PgllXi , . . . , PrHXi). 

We call the triple C{R{1), H{1), Q{1)) = (/, K, A) characteristic G{1) of 1. We 
say that the pair (/, K) is an interval model at Xi and A is the correspond- 
ing sequence of typical sequences. From now on, whenever we consider an edge 
ordering we will associate it with some node Xi of a path decomposition. 

Using Lemmata 3.5 and 3.5 of [4] we can prove that the number of different 
characteristics of all possible edge orderings of Gi with linear-width at most fc, 
is bounded by a function of k and I (i.e. is independent of n). 

The following procedure defines function Com, that maps a typical triple 
{I, K, A) to another typical triple that is a “compression” of the input triple. 



Procedure Com(/, A, A) 

Input: A typical triple (/, K, A) 

Output: A typical triple (I, K, A). 

1: Set r = |/| = \K\ = |A|. 

2: Apply the following operation until it is no longer possible. 

• If 3g<h<r-i ■ {h = Ih+i and Kh+i = 0) then set 

I — {Iq j ■ ■ ■ 1 Ir—l) {Iq j ■ ■ ■ t Ihi Ih+ 2 i ■ ■ ■ 1 Ir) j 

K={Kg,..., Kr-l) ^ (Ao, . . . , Kh, Kh+ 2 , . . . , Kr), 

A = {Ag, . . . , Ar-l) ^ {Ag, . . . , Ah-l,T{Ah 0 
Ah+i),Ah+2,---,Ar), and 
r <— r — 1. 

3: end. 
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A set FS{i) of characteristics of edge orderings of a graph Gi {i is a node of 
the path decomposition) with width at most k is called a, full set of characteristics 
at i if for each linear ordering I of Gi with linear- width at most k, there is a edge 
ordering I' such that G{1') ^ C{1) and C{1') G FS{i), i.e. the characteristic of I' 
is in FS{i). The following lemma can be derived directly from the definitions. 

Lemma 1. A full set of characteristics at i is non-empty if and only if the 
linear-width of Gi is at most k. If some full set of characteristics at i is non- 
empty, then every full set of characteristics at this node is non-empty. 

An important consequence of Lemma 1 is that the linear-width of G is at 
most k, if and only if any full set of characteristics of G\x\ = G is non-empty. 
Clearly, in the cased z = 0, Gi is edgeless and thus Fq = {((),(),())}. In what 
follows, we will show how to compute a full set of characteristics at a node Xi in 
0(1) time, when a full set of characteristics of Ai_i is given. This will depend 
on the type of the node Xi and will be explained in the following sections. 
Introducing an edge: The procedure int is an important ingredient of our 
algorithm. Given a typical triple (/, AT, A), it “inserts an edge”: after the mth 
position in the jth sequence in A, an edge is inserted; S is the set of the vertices 
that are endpoints of the edge and W consists of the endpoints of S that are 
isolated before the insertion (and therefore become pendant after it). Both I, K, 
and A are modified accordingly after the insertion. 



Procedure \r\t{I,K,A,j,m,S,W) 

Input A typical triple (/, AT, A) where / = {Iq, ■ . ■ , Iq), AT = {Kq, ■ ■ ■ , Kg), 
A = (Aq, . . . , Aq) where Ai = (a\, . . . , a* J, I <i < q, two integers j, m where 
0 < j < (?, 1 < ™ < ft) and two vertex sets S,W G W where S' — IT C 
Uo<i<g^WU A:(f). 

Output A typical triple {I, K, A). 

1: {Splitting step) Apply the following three steps. 

• Set / = (/g, . . . , Iq+i) < {Iqi ■ ■ ■ ^ Ij-li Ij^ ^i+l) ■ • ■ ) ^q)- 

• Set K' = (A' , . . . , a;+i) ^ (Ag - S, . . . , A,_i - S, A, - S, IT, A,+i - 
S,...,Kq-S). 

• Set A' = ^ 

1 ■ • ■ 1 -^r) ■ 

2: {Insertion step) For any x € S — IT, apply the following. 

• For any /i, 0 <h< g-l-1, min{L//uic'(a;), j-|-l} <h< maxjj, A//uif'(a:) — 1} : 

set {a;} and ^ A^ -b 1 

3: Output A', A'. 

4: end. 



The following lemmata provide important information on procedure int. The 
proofs is quite technical (see [6] for the details). 

Lemma 2. Lett be an ordering and assume that {I, K, A) = C{R{1), H{l),Q{l)). 
Then for any vertex sets S, IT where IT C S, the following hold. 
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(i) For any j, to, 0 < j < q = |/|,1 < m < qj = \Aj\, there exists an integer 
7 , 0 < 7 < ^ = 1^1 such that 

Com(lnt(/, A, j, TO, S', VH)) = Com(lnt(i?(l), i7(l), Q(l), 7 , 1,S, VH)). 

(ii) For any 7 , 0 < 7 < r = |?|, there exist two integers j, to, 0 < j < g = | J|, 1 < 
m < qj = \Aj\ such that 

Com(lnt(/, A, A, j, TO, S, VH)) Com(lnt(i?(l), iS(^), Q(^), 7 , 1,S, VH)). 

Lemma 3. Let (/*,A*,A*),i = 1,2 he two typical triples such that 

(I\K\A^) >- (/2,A2,A2). Then, for any S,W C S,j,0 < j <q = \I^\ = \P\ 
and TOi, 1 < TOi < gj = |Aj| there exists a m 2 , 1 < m 2 < q^ = l^|l; such that 
Com(lnt(/^, A^, A^, j, TOi, S, kL)) Com(lnt(/^, A^, A^, j, TO 2 , S, VL)). 

A full set for an introduce node: We will now consider the case where Xi 
is an introduce node. Clearly Vi = Vi-\ U {a:} where x ^ Vi-\. Suppose that 
Ex = {ei, ■ ■ ■ ,et},0 < t < |Ai_i| < I is the set of edges incident to x in Gi 
(notice that, UeGs^e C Xi). If A^, = 0, then, we simply set FS{i) = FS{i — 1). 
What remains is to examine the case where \Ex\ > 1. 

We define = (V{Gi), A(Gi_i)U{ei, . . . , Cp}), 0 < p < t. Clearly, FS{i — l) 
is a full set of characteristics for G° = Gi_i. Notice also that Gi = G\. Suppose 
that we have a full set of characteristics FS{i,p—l) for Gf~^, 1 < p < t (which is 
the case when p = 1). It is sufficient to give an 0(1) time algorithm constructing 
a full set of characteristics ES{i,p) for G^. In what follows W = CpC A(G(') (i.e. 
the vertices of the inserted edge that are pendant in G^). 

Algorithm Introduce-edge 

Input: A full set of characteristics ES{i,p— 1) for Gf~^. 

Output: A full set of characteristics FS{i,p) for G^. 

1: Initialise FS{i,p) = 0. 

2: For each characteristic {I, A, A) G FS{i,p— 1) where / = (/q, ■ ■ ■ ,Iq) and 
A = (Ao, . . . , Aq), apply step 3. 

3: For any set Ij G I let Aj = {a\, . . . , a^^.) and apply step 4. 

4: For any a^, 1 < to < in Aj, 

set (/', A',A') = Com(lnt(/', A', A', j, TO, Cp, IF)) and 
if max(A') < k, then set FS{i,p) ^ FS{i,p) U {(/', A', A')}. 

5: end. 

The proof of the following is based on Lemmata 2 and 3. 

Lemma 4. The set FS{i,p) constructed by algorithm Introduce-edge is a full 
set of characteristics. 

A full set for a forget node: We will now consider the case where Xi is a 
forget node. Clearly, Gi = Gi-i and there exists a unique vertex v G Ai_i with 
V ^ Xi. We call this vertex v forgotten. Given a full set of characteristics E{i— 1) 
for Xi-i, the algorithm Forget-Vertex computes a full set of characteristics E{i) 
for Xi. 
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Algorithm Forget- Vertex 

Input: A full set of characteristics FS{i — 1) for Gi-\ and a forgotten vertex 

X. 

Output: A full set of characteristics FS{i) for Gj. 

1: Initialise FS{i) = 0. 

2: For any (/, K,A) G FS(i — 1) set FS{i) ^ FS{i) U {Com(/ — {x}, K — 
{x},A)}. 

3: end. 



Lemma 5. The set FS{i) constructed by the algorithm Forget- Vertex is a full 
set of characteristics. 

The decision algorithm: Using the algorithms of the previous sections we can 
compute a full set of characteristics for G\, G 2 , Gz, ■ ■ ■ ,G\x\ = G (obviously 
E{Gi) = 0). Notice that if a graph consists of a single edge Cgtart = {Tgtart) "^start}) 
its full set of characteristics is ({^'start, ^'startl): ((0)))- Using this 

full set of characteristics as a starting point we can use the procedures of the 
previous sections to compute the full sets for G 2 ,Gs, etc., in order. Note that 
the computation needs 0(1) time per node of the path decomposition, and thus 
in total, time linear on the number of vertices of G. After the full set for the last 
node has been computed, in 0(1) time one can decide whether the linear-width 
of G is at most k, as this holds if and only if this last full set AS'(|A|) is not 
empty. We conclude with the algorithm Check-Linear-width. It is possible to turn 
this algorithm to a constructive one (see [6] for the details) . 

Algorithm Check-Linear-width(G, A, k). 

Input: A graph G, a path decomposition A of G with width I, and an integer 

k. 

Output: Whether the linear-width of (G) is at most k. 

1: Let Fo = {((), (),())}. 

2: For any i, I < i < |A|, 

compute a full set of Ai-characteristics Fi for Gi using Fi_i and depending 
on whether Xi is an introduce or a forget node. 

3: Output Fr yf 0. 

4: End. 

We conclude to the following. 

Theorem 1. For all k,l > 1 there exists an algorithm that, given a graph G and 
a path decomposition A = {Xi, 1 < i < \X\) of G with width at most I, computes 
whether the linear-width of G is at most k and, if so, constructs an edge ordering 
of G with linear-width at most k and that uses 0{V{G) -F |A|) time. 

The results presented so far can be trivially extended to graphs with parallel 
edges. In such a case, we should consider the complexity of the algorithm in 
Theorem 1 to be 0(|F(G)| -F |A|). 
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4 The Consequences of Our Algorithm 

In this section we define several search game parameters and we present their 
relations with linear-width. Using these relations we conclude that there exist 
for any fixed k, linear time algorithms that check whether given graphs can be 
mixed, node, or edge searched with at most k searchers, and if so, output the 
corresponding search strategies. 

A mixed searching game is defined in terms of a graph representing a system 
of tunnels where an omniscient and agile fugitive with unbounded speed is hidden 
(alternatively, we can formulate the same problem considering that the tunnels 
are contaminated by some poisonous gas). The object of the game is to clear all 
edges, using one or more searchers. An edge of the graph is cleared if one of the 
following cases occur: A: both of its endpoints are occupied by a searcher, B: a 
searcher slides along it, i.e., a searcher is moved from one endpoint of the edge 
to the other endpoint. 

A search is a sequence containing some of the following moves, a: place a new 
searcher on a vertex, b: remove a searcher from a vertex, c: slide a searcher, 
residing on some of the endpoints of an edge e, along e and place it on the 
other endpoint of e. The object of a mixed search is to clear all edges using 
a search. The search number of a search is the maximum number of searchers 
on the graph during any move. The mixed search number, ms(G), of a graph 
G is the minimum search number over all the possible searches of it. A move 
causes recontamination of an edge if it causes the appearance of a path from an 
uncleared edge to this edge not containing any searchers on its vertices or its 
edges. (Recontaminated edges must be cleared again.) A search without recon- 
tamination is called monotone. The node (edge) search number, ns(G) (es(G)) is 
defined similarly to the mixed search number with the difference that an edge 
can be cleared only if A (B) happens. Parts (a)-(e) of the following result were 
proved (or follow easily) by Bienstock and Seymour in [2] (see also [24]). The 
last part is proven in [25]. 

Theorem 2. For any graph G the following hold: 

(a) If ms (G) < k then there exist a monotone mixed search in G using at most 
k searchers. 

(b) linear-width(G) < ms(G). 

(c) If G® is the graph occurring from G after subdividing each of it edges, then 
es(G) = ms(G®). 

(d) If G" is the graph occurring if we replace every edge in G with two edges in 
parallel, then ns(G) = ms(G”). 

(e) If G^ is the graph occurring from G after subdividing each of its pendant 
edges, then ms(G) = linear- width(G^). 

The result of Theorem 1 has several consequences. First, as one can find 
a path decomposition of a graph G with width at most I, if existing, in lin- 
ear time ([3,4], but see also below) and using the fact that pathwidth(G) < 
linear- width (G) we can conclude to the following result. 
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Theorem 3. For all k > 1, there exists an algorithm that, given a graph G, 
eomputes whether the linear-width of G is at most k and, if so, eonstructs an 
edge ordering of G with linear-width at most k and that uses 0{V{G)) time. 

Using now Theorem 2 , we obtain the following. 

Theorem 4. For all k, I, there exists an algorithm, that given a graph G and a 
path decomposition X = {Xi, 1 < z < |X|) of G with width at most I, computes 
whether the mixed search number (edge search number; node search number) of 
G is at most k, and if so, constructs a mixed search (edge search; node search) 
that clears G with most k searchers, and that uses at most 0{\V (G)| + |^|) time. 



Using small modifications of techniques from [3] , the result above can be used 
to obtain an alternative (but strongly related) proof for the result from [3,4] that 
for each fixed k, the problem to determine whether a given graph has pathwidth 
at most k, and if so, to find a path decomposition of width at most k, has a 
linear time parameterized algorithm. Using this fact, we obtain from Theorem 
4 the following. 

Theorem 5. For all k, there exists an algorithm, that given a graph G, computes 
whether the mixed search number (edge search number; node search number) of 
G is at most k, and if so, constructs a monotone mixed search (edge search; node 
search) that clears G with most k searchers, and that uses at most 0(|U(G)|) 
time. 
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Abstract. We introduce a notion of fixed-parameter tractability per- 
mitting a parameter dependence of only . We delve into the corre- 

sponding intractability theory. In this course we define the PW-hierarchy, 
provide some complete problems, and characterize each of its classes in 
terms of model-checking problems, in terms of propositional satisfiability 
problems, in terms of logical definitions, and by machine models. 

We also locate the complexity of some model-checking problems that are 
fixed-parameter tractable but not boundedly so. 



1 Introduction 

The aim of complexity theory can be described as a classification of combinatorial 
problems into tractable ones and intractable ones. The research of complexity 
theory consists in large parts of charting specific problems. A smaller but not 
less important part is the evolution of the underlying concepts of tractability 
and intractability. For example the notion of fixed-parameter tractability (FPT) 
from parameterized complexity theory can be motivated by several aspects, in 
which the unparameterized theory around polynomial time (P) deviates from 
practical reality. However, the introduction of FPT was (intentionally) simplistic, 
permitting arbitrary, or arbitrary computable, dependence of the running time 
on the parameter. In this context general techniques such as Courcelle’s Theorem 
or the Well-Quasi-Ordering Principle spawn tractability results that cannot be 
transfered into remotely practical algorithms. 

The remedy is, of course, to introduce bounds on the parameter dependence. 
There are, however, several possibilities to do so. In [I] the case was ex- 
plored; despite all its merits it is less robust than might be desirable. Here we 
will investigate the case It will turn out, that this choice retains most 

of FPT’s robustness, while new structure appears. 

We will be concerned mainly with intractability exploration. We identify 
classes that are analogues of the W-hierarchy. However our PW-hierarchy has 
additional intermediate classes. After identifying some complete problems we 
present characterizations of all its classes in terms of propositional logic, first- 
order logic, and machine models. Additionally we locate the problems from [2], 
that initiated the examination of bounded parameterized complexity, in our new 
framework. Indeed they are complete for PAW [*] and PPA; the former is our 
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analogue of AW [*], the latter is a class the unbounded analogue of which has 
not yet been investigated to the author’s knowledge. 

Space limitations force us to defer many proofs to the full version. 

2 Basic Notions 

2.1 Parameterized Complexity 

Let E be an alphabet with at least two characters. We will assume all encodings 
to be made in A. A parameterized (decision) problem is a pair P = (Q, k), where 
Q C A* is a classical decision problem and k : A* ^ N is a parameterization. 
Parameterized problems will usually be stated in the following form: 



Input: A structure 21 and a formula tp 
Question: Does ^\= hold? 

Parameter: \ip\ 



This notation assumes some encoding of structures and of formulas. Here Q is 
the set of encodings of pairs (21, pi), such that 21 is a structure, p \s & formula, 
and %\= p does hold. The parameterization k is: 

J \p\ , if a; is of the form (21, p) 

^ ^ ( 0 otherwise 

The default value 0 is arbitrary. 

Let IF be a set of functions N ^ N. Then the class IF-FPT consists of those 
parameterized problems {Q,k), for which the question ‘x € Q?’ can be decided 
in deterministic time /(«:(a;)) • |a:|'^ for some f & P and c S N. The usual notion 
of fixed-parameter tractability is 

FPT := {/ : N ^ N I / computable}-FPT . 

Here we will be concerned with the stricter notion 



PPT := -PF-FPT , 



where 






rS:=\k 



ceN 



(polynomially exponential functions). 

Let P = (Q,k) and P' = {Q',k') be parameterized problems. A many-one 
reduction R from Q to Q' is called 



1. an fpt-reduction from P to P', if there are computable functions / and 
g and a natural number c, such that R{x) is computable from x in time 
f{K{x)) ■ and such that k'{R{x)) < g{K{x)) for all but finitely many 
a; G A*. 
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2. a ppt-reduction from P to P' , if there are natural numbers c, d, and e, 

such that R{x) is computable from x in time • |a;|'^, and such that 

k'{R{x)) G {K{x)y + (log that is, for all natural numbers / there is 

a natural number g such that k'{R{x)) < {k{x)Y + g ■ (log \x\)f for all but 
finitely many x G E*. 

3. a simple ppt-reduction from P to P', if there are natural numbers c, d, and 

e, such that R{x) is computable from x in time and such that 

k'{R{x)) < {K{x)y for all but finitely many x G E*. 



The natural properties hold: fpt-reductions preserve pertainment to FPT, ppt- 
reductions preserve pertainment to PPT, and simple ppt-reductions are ppt- 
reductions. Furthermore simple ppt-reductions are fpt-reductions. It should be 
noted that the only results, that require non-simple ppt-reductions, are Propo- 
sition 12 and Proposition 13. In these cases n'{R{x)) G 0{{n{x)Y -I- loglog |a:|). 
For a G {f, p} we write P <“p* P' if an apt-reduction from P to P' exists. We 
write P P' if both P <“p* P' and P' <°‘P^ P. Finally we define the class 



[P]“P‘ := {P' 



p! ^apt p| 



2.2 Propositional Logic 

Propositional formulas are built from propositional variables by conjunction, dis- 
junction, and negation. Let Prop denote the class of all propositional formulas. A 
propositional literal is a propositional variable (atom) or a negated propositional 
variable. 

For natural numbers t > 0 and d > 1 we define the classes Pt,d, ^t,d Q Prop 
by recursion over t\ 

Eo,d ■= {Ai A • • • A As I s < d and all Xi are propositional literals} 

XXo,d ■= {Ai V • • • V As I s < d and all Xi are propositional literals} 
Rt+i,d ■= {A-^ I n C At^d} 

At+i,d ■■= {y n I iicri,4 

For P C Prop define F'*', P~ C P as follows: A formula a G P belongs to F'*', 
if all atoms in a occur positively and a belongs to P~ , if all atoms in a occur 
negatively. 

For F C Prop let p-WSAT(F) (parameterized weighted satisfiability prob- 
lem) be the following parameterized problem: 



Input: A formula a G F and fc G N 
Question: Does a have a satisfying assignment that sets exactly 
k variables to TRUE? 

Parameter: k 
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Often the following problem p-PSAT(_T) (parameterized partitioned satisfiability 
problem) is more convenient: 



Input: A formula a G F and a partition {Xi)icick of var(o;) 
Question: Does a have a satisfying assignment that sets exactly 
one variable from each to TRUE? 

Parameter: k (the number of sets in the partition) 



2.3 First-Order Logic 

Some aquaintance with first-order logic is assumed, we will only fix some con- 
ventions. We restrict ourselves to relational vocabularies. First-order atoms are 
of the form Rx\ . . .Xr, where the xt are first-order variables and the relation 
symbol R has arity r. First-order formulas are built from first-order atoms by 
conjunction, disjunction, negation, and existential and universal quantification. 
Let t be a natural number. A first-order formula if pertains to St if 

1. t is odd and ip is of the form 

ip = 3a;i,i . . . a;i,fciVa:2,i . . . X2M ■ ■ ■ 

with quantifier free or 

2. t is even and p is of the form 

p = 3x1,1 ■ • ■ Vx2,i . . . X2,k2 ■ ■ ■ Vxt,i . . . xt,kti> 
with quantifier free ■0. 

For a natural number u such a p pertains to if all but the first quantifier 
block are of length at most u, that is, if ki < u for all i > 2. p furthermore 
pertains to S*t^u, if 

1. t is odd and 0 is a conjunction of literals, or 

2. t is even and 0 is a disjunction of literals. 

p pertains to lit, if the formula obtained from p by replacing 3 by V and vice 
versa pertains to St. If L is any of these logics, define L~^ to be the fragment in 
which no atom occurs negatively and L~ to be the fragment in which no atom 
occurs positively. 

Let T = {i?i, . . . , be a relational vocabulary, the arity of Ri being r^. A 
T-structure is a tuple 21 = (A, Rf , . . . , Rf), where each Rf C A^\ The arity of 
2t is max n. The size 1211 of 2t is defined as 

Ki<s 



|2t| := |A| + |r| + ^ r, . \Rf 

l<i<s 



and only polynomially deviates from the length of a reasonable encoding of 21. 
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Let L be a logic and C be a class of structures. We present two parame- 
terizations of the model-checking problem for L on C, denoted p-MC{C, L) and 

p-MCvar(C,L): 



Input: A structure 21 G C and a formula tp G L 
Question: Does 2t ^ 9 ? hold? 

Parameter: |<p| 



Input: A structure 21 € C and a formula tp € L 
Question: Does 2t ^ (p hold? 

Parameter: |var((p)| 

For the first case \(p\ is defined as the number of nodes in a syntax tree for ip. 
This is the length of (p in a rich enough alphabet. We will simply write p-MC(L), 
if C is the class of all structures, p-MC(s, L) for s S N, if C is the class of all 
structures of arity at most s, and likewise use p-MCvar(L) and p-MCvar(s, L)- 



3 The PW-Hierarchy 

The notions and < pp * are incomparable (For one direction consider problems 
P G FPT \ PPT and P' € PPT, with P' being non-trivial. Then P <^p* P' but 
p ^ppt p' Por the other direction consider an unparameterized problem Q that 
is undecidable and the parameterizations k : x 0 and k' : x log log | a: |. 
Then (Q,k) <pp*^ {Q,k'), but (Q,k) ^^p* (Q, k')). At first glance it would then 
seem, that the theory of lower bounds has to be built up from scratch. It turns 
out, on the other hand, that many completeness results from the FPT-theory 
(that is, statements of the form P =^ p * P') carry over to the PPT-theory (that 
is, P =PP* P' also holds, often even using the same reductions). We will therefore 
introduce all notions for lower bounds as analogously to the ones from the FPT- 
theory as possible. 

For natural numbers t > 1 we set 

PW[t] := U [p-WSAT(r*,d)]PP‘ 

d>l 

as an analogue of W [t] . Many normalization properties that are well-known 
from FPT-theory still hold. In order to save case distinctions we introduce a 
short notation. For even t we define (— )* to be -I- while for odd t we define (— )* 
to be — . 



Lemma 1. For t,d> 1, such that t >2 or d> 2, the problems p-WSAT(Ti_d)> 
p-WSAT ^ ^ p-PSAT(ri_d), and p-PS AT (^P^ J ^ are PW [t]-complete. 

The proofs are refinements of those for the W-hierarchy. 
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There is, however, one completeness missing. The problem p-WSAT 
is W [t]-complete (see [3]). The analogue does not seem to hold for ppt-reductions. 
We therefore introduce further classes into our hierarchy. The new classes are 



PW 



t + 



p-WSAT 



(ctr.) 



I ppt 



for natural numbers t > 1. We retain some of the above normalizations: 

Lemma 2. For t,d > 1 the problems p-WSAT and p-PSAT 

are PW [t + i] -complete. 



The proof can be found in this paper’s full version. 

Since C ^ C Ft+ip, the following is clear now: 

Corollary 3. PW [1] C PW [1.5] C PW [2] C PW [2.5] C . . . 



We conjecture the strictness of these inclusions, but a proof, even for a single 
one, would imply P yf NP. 



3.1 Some Complete Problems 

We define five parameterized problems p-CLIQUE, p-ACC (short turing machine 
acceptance), p-HIS (hypergraph independent set), p-DS (dominating set), and 
p-HDS (hypergraph disconnecting set): 



Input: A graph Q and fc G N 
Question: Does Q have a clique of size fc? 
Parameter: k 



Input: A nondeterministic turing machine M and fc G N 
Question: Does M have an accepting run of length k? 
Parameter: k 



Input: A hypergraph H and A: G N 
Question: Does H have an independent set of size fc? 
Parameter: k 



Input: A graph Q and A: G N 
Question: Does Q have a dominating set of size k? 
Parameter: k 



Input: A hypergraph H and A: G N 
Question: Does H have a disconnecting set of size kl 
Parameter: k 
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An independent set of a hypergraph {V, E) is a set Vb C V, such that e C Vg for 
no e G if . A disconnecting set of {V, E) is a set Vg C V, such that for all e G if 
there is some e' G if with e 0 e' C Vg (e' is disconnected from e by removing the 
vertices Vg). 

By variations of the proofs for the W-hierarchy one obtains: 

Proposition 4. p-CLIQUE and p-ACC are PW [1] -complete. p-DS is PW[2]- 
complete. 

The following are new: 

Proposition 5. p-HIS is PW [1.5] -compfete. p-HDS is PW [2. 5]-complete. 

Proof. For p-HIS <pp* p-WSAT(il 2 ~i) let a hypergraph H = {V,E) be given. 
We introduce propositional variables A„ for all v G V. Then truth assignments 
for these variables correspond in a natural way to subsets of V. That such a 
subset is an independent set is expressed by 

a := A V ■ 

eGiS 

Hence {H, k) pertains to p-HIS if and only if {a, k) pertains to p-WSAT. 

For the converse note that the formula just constructed was generic for Fj"!- 
Indeed, let 

“ = A V 

i&I jeJi 

be an arbitrary F 2 ~i“fopmula. Then with V := var(a) and 
E := {{X, I j G Ji} I z G /} , 

a describes the independent set property for H := (V, if). Hence (a, k) pertains 
to p-WSAT if and only if {H, k) pertains to p-HIS. 

For p-HDS <PP* p-WSAT we can proceed analogously to p-HIS <pp* 
p-WSAT (^ 2 “^) above with 



a:= A V A •’f. ■ 

e^E e'^E v^eDe' 

For p-WSAT (r'b'ji) <pp‘ p-HDS let (a,k) be given, say 

“ = A VAt . 

iei jeJi 

Set A := var(a). Without loss of generality we may assume |A| > k and all Ji 
to be disjoint. We construct a hypergraph H = (F, if) as follows: V consists of 
vertices vx for A G A, of vertices Wi^h for z G / and 1 < A < fc -I- 1, and of 
vertices Uh for l<h<k+l. E consists of a dummy hyperedge 



d := {uh I 1 < A < fc -I- 1} , 
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of hyperedges 



e* 



[ Vx 


xg\J X,] 


[ 


j(^Ji ) 



U {wi^h 



l<h<k+l}Ud 



for is/, and of hyperedges 



fj ■= {vx I X G Xj} U {w./i I i' & I \ {i}, 1 < < fc + 1} 



for i G / and j G Ji- What would a disconnecting set of size (at most) k 

look like? d D fj = 0, hence d and all fj are disconnected from some other 

hyperedge by any set of vertices, d f] et D d and Cj fl e^/ 3 d, hence a set 
of size at most k can disconnect ei at most from some fj. If j ^ Ji, then 

Ci n fj 3 {wi^h I 1 < < fc + 1}, hence can be disconnected at most from 

those fj with j G Ji- In this case Ci C\ fj = {wx | X G Xj}. Now H has a 
disconnecting set of exact size k, if and only if H has a disconnecting set of size 
at most k, if and only if there is some Vq Q {vx \ X G X} oi size at most k such 
that for all i G / there is a j G Ji such that vx G Vq for all X G Xj, if and only 
if a has a satisfying assignment that sets at most k variables to TRUE, if and 
only if a has a satisfying assignment that sets exactly k variables to TRUE. □ 



Interestingly, if the sizes of hyperedges are bounded, then p-HDS G P. This 
is due to the fact that every hyperedge is a disconnecting set. 



3.2 Characterizations by Model-Checking 

The following result carries over from the FPT-world (see [4]): 

Proposition 6. For t,u > 1 and s >2 the parameterized problems p-MC(Ut_„) 
and p-MC(s, are PW [f\-complete. 

This can be extended to: 

Proposition 7. For t > 1 and u,s > 2 the problems p-MCvar(s, and 

p-MCvar^s, J ^ are PW [t]-complete. 

The new result for our intermediate classes reads: 

Proposition 8. For t,u > 1 the parameterized problems p-MCvar (£'**,«) and 
p-MCvar J ^ are PW [t + -complete. 

The proofs of the latter two can be found in this paper’s full version. 
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3.3 Characterizations by Fagin-Definitions 

Let T be a first-order vocabulary and ip an FO [T]-formula possibly containing 
a second-order variable Y . For simplicity of notation let Y be monadic. The 
parameterized problem FD<^ (Fagin-defined by p) is the following: 



Input: A r-structure 21 and fc G N 
Question: Is there some B C A with \B\ = k, such that p\B]l 
Parameter: k 



Proposition 9. Let t > 1 be a natural number. 

1. PW[t]= U U 

2. PW[t+^]= U [FD^]'’"’*- 

The proof can be found in this paper’s full version. 



3.4 Characterizations by a Machine Model 

The machine model AQRAM (alternating query RAM) is an extension of the 
RAAI model. It is somewhat similar to the W-RAM-model of [5]. Where the 
latter introduces instructions for querying binary relations in matrix represen- 
tation in one step, our AQRAM has more natural instructions that allow for 
querying arbitrary relations in list representation in as many steps as a normal 
RAM would need. 

An AQRAM has access to three sets of registers: 

1. The standard registers s(0), s(l), s(2), 

2. The data registers d(0), d(l), d(2), 

3. The index registers i(0), f(l), f(2), 

Data and index registers are called nonstandard registers. The AQRAM inher- 
its the usual (deterministic) instructions from the RAM. These are performed 
solely on standard registers. In addition the AQRAM may perform the following 
instructions: 

1. D-EXISTS t j- Guess a value for a data register. At this point the program 
branches existentially. There is one continuation for each 0 < n < s(0), and 
in this continuation the register d(s(j)) is set to n. 

2. D-FORALL | j: Like D-EXISTS "f j, but the branching is universal. 

3. I-EXISTS t J and I-FORALL "f j: Like the preceeding, but these affect 
i{s{j)) instead of d{s{j)). 

4. IS-JEQUAL h,'\ j,m: Conditional jump to instruction m. The condition 
is s(s(0) -Ff(s(/i))) = s(j). 
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5. ID-ASSUME ttt TT j- Check the condition s(s(0) + i{s{h))) = d{s{j)). If 
it holds, continue, if it does not hold, stop and reject. 

6. ID-COASSUME tTT TT j- Check the condition s(s(0) + z(s(A-))) = d(s(j)). 
If it holds, continue, if it does not hold, stop and accept. 

The new conditional instructions justify the name of the new registers. A stan- 
dard register, which is indexed by an index register (with a base displacement of 
s(0)) is queried. It is compared with another standard register, or with a data 
register. 

For t G N and a parameterization k : A* ^ N an AQRAM program iT is a 
PW [t + i] -program with respect to k, if there are c,d,u G N, such that for all 
but finitely many inputs w and all paths tt in the run of 7T on w the following 
requirements hold: 

1. |7t| < • |w|C 

2. At most 2^”^™)) • |w|'^ registers are used. 

3. No register ever contains a value greater than ■ |w|'^. 

4. There are at most (k{w)) steps after the first step that is not deterministic. 

5. There are at most t alternations of branching, starting existential, and only 
the first block may contain more than u branching instructions. 

6. This first block does not contain any I-EXISTS-instructions. 

7. For odd t no ID-ASSUME-instructions are used; for even t no ID-COASSUME- 
instructions are used. 

77 is a PW [t]-program with respect to k, if furthermore there is some bound h, 
such that no path of a run of 77 contains more than h ID-ASSUME-instructions 
or ID-COASSUME-instructions. 

Proposition 10. Let t > 1 be such that 2t G N. Then a parameterized problem 
(Q, k) pertains to PW [t] if and only if Q is decided by some AQRAM -program 
that is a PW [t]-program with respect to k. 

The proof can be found in this paper’s full version. 

4 PAW [*] and PPA 

A well-known result from unbounded parameterized complexity theory is 

p-MC(Words,MSO) G FPT 

(for example it is implicit in [6]). On the other hand [2] implies 

p-MC ( Words, FO) ^ PPT 

(unless FPT = AW [*]). In this section we identify the complexity of both prob- 
lems with respect to PPT. The former will be an analogue of AW [*]. 
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If T is a class of propositional formulas, then p-AWSAT(T) is the following 
problem: 

Input: a € r, a, partition of var(o;), and weights 

i^i)l<i<q 

Question: Is there a subset 3^1 C Xi with |3fi| = ki, such that 
for all subsets 3^2 C X2 with 13^21 = ^2 there is a 
subset 3^3 C A3 with 13^31 = k^, . . . , such that the 
truth assignment, that sets exactly those variables to 
TRUE, that occur in some 3^*, satisfies a? 

Parameter: ^ ki 

l<i<q 

We also consider the special case p-APSAT(_T), in which ki = 1 for all i: 

Input: a G r and a partition {Xi)icick of var(a) 

Question: Is there some Ai G Xi, such that for all X2 G X2 
there is some A3 G A3, . . . , such that the truth as- 
signment, that sets exactly the variables Ai, . . . , A^ 
to TRUE, satisfies a? 

Parameter: k 

By dropping all weight restrictions we obtain p-QBF(_T): 

Input: a G r and a partition of var(a) 

Question: Is there a subset 3^1 U Ai, such that for all subsets 
3^2 C X2 there is a subset 3^3 C A3, ... , such that the 
truth assignment, that sets exactly those variables to 
TRUE, that occur in some 3^1, satisfies a? 

Parameter: k 

Define PAW [*] as IJ [p-AWSAT(ri_d)]*’'’* and PPA (pure alternation, or, if you 

t,d> 1 

prefer, parameterized by alternation) as [p-QBF(Prop)]*’^*. Note that PPA yf 
PPT unless P = NP, because SAT is a slice of p-QBF(Prop). 

Lemma 11. Let t,d> 1. 

1 . If t >2 or d> 2 , then the parameterized problems p-AWSAT(ri_d) and 
p-APSAT(Tt_d) are PAW [*]-compfete. 

2 . If t > 2 or d > 3 , then the parameterized problem p-QBF(ri_rf) is PPA- 
complete. 

Proposition 12. T/ie proWems p-MC(Words, FO) and p-MC(FO) are eomplete 
/or PAW [*]. 

Proposition 13. T/ie proWems p-MC(Words, MSO) andp-MC(MSO) are com- 
plete for PPA. 

The proofs can be found in this paper’s full version. 
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5 Further Remarks 

The results depicted here are part of the author’s doctorate research. His forth- 
coming dissertation [7] contains further material, especially a treatment of the 
PA-hierarchy, which is the PPT-analogue of the A-hierarchy. 
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Abstract. Memorisation is a technique which allows to speed up expo- 
nential recursive algorithms at the cost of an exponential space complex- 
ity. This techniqne already leads to the currently fastest algorithm for 
fixed-parameter vertex cover, whose time complexity is 0(1.2832*^fc^ ® -|- 
kn), where n is the nnmber of nodes and k is the size of the vertex cover. 
Via a refined use of memorisation, we obtain a 0(1.2759*^fc^ ® -I- kn) al- 
gorithm for the same problem. We moreover show how to further reduce 
the complexity to 0(1.2745*^ -I- kn). 



1 Introduction 

A vertex cover of an undirected graph G is a subset C of nodes such that any 
edge is incident on at least one node in C. The vertex cover problem consists in 
deciding whether G admits a vertex cover of at most k nodes. 

It is well known [6] that vertex cover is fixed-parameter tractable, that is it 
can be solved in 0(/(/c)n“) steps, where n is the number of nodes, / is a function 
of the parameter k only, and a is a constant. A lot of effort was recently devoted 
to develop faster and faster fixed-parameter algorithms for vertex cover. The first 
fixed-parameter algorithm for vertex cover is the 0(2^n) algorithm of Fellows [8] 
(see also [9]), which is based on a bounded search tree strategy. Buss and Gold- 
smith [2] proposed a 0(2^fc^^+^ -b kn) algorithm, in which they introduced the 
kernel reduction technique. Gombining their bounded search tree strategy with 
the kernel reduction of Buss and Goldsmith, Downey and Fellows [5] obtained a 
0{2^k‘^ -\-kn) algorithm. The complexity was later reduced to 0(1.3248^fc^ -b fcn) 
by Balasubramanian, Fellows and Raman [1], to 0(1.2918^/c^ -b kn) by Nieder- 
meier and Rossmanith [10], and to O(1.2906^fc -b kn) by Downey, Fellows and 
Stege [7]. The current fastest polynomial-space algorithm is the 0(1.2852^fc-bA:n) 
algorithm of Ghen, Kanj and Jia [3]. Thanks to the interleaving technique of Nie- 
dermeier and Rossmanith [11], it is possible to get rid of the polynomial factor 
in the exponential term of the complexity. For example the complexity of the 
algorithm of Ghen et al. can be reduced to 0(1.2852^ -b fcn). It is worth to notice 
that, though such polynomial factor is not relevant from the asymptotic point of 
view (the base of the exponential factor is overestimated), it is usually indicated. 
The reason is that the value of k is assumed to be not “too” big. 

Memorisation is a technique developed by Robson [14,15] in the context 
of maximum independent set, which allows to reduce the time complexity of 
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many exponential time recursive algorithms at the cost of an exponential space 
complexity. The key-idea behind memorisation is that, if the same subproblem 
appears many times, it may be convenient to store its solution instead of recom- 
puting such solution from scratch. With this technique Robson [15] managed 
to derive a 0(1.1889") exponential-space algorithm for maximum independent 
set from his own 0(1.2025") polynomial-space algorithm for the same problem. 
Memorisation cannot be applied to the algorithm of Chen et ah, since their 
algorithm branches on subproblems involving graphs which are not induced sub- 
graphs of the kernel. Niedermeier and Rossmanith [12,13] applied memorisation 
to their 0(1.2918^/c^ -I- kn) polynomial-space algorithm for vertex cover, thus 
obtaining a 0(1.2832^fc^'® -I- kn) exponential-space algorithm for the same prob- 
lem, which is also the currently fastest algorithm for vertex cover. It is worth to 
notice that the polynomial factor in the exponential term cannot be removed any 
more with the interleaving technique of Niedermeier and Rossmanith. In fact, 
such technique is based on the idea that most of subproblems concern graphs 
with few nodes. This is not the case when memorisation is applied. 

1.1 Our Results 

The kind of memorisation which is currently applied to vertex cover is in some 
sense a weaker version of the technique originally proposed by Robson for max- 
imum independent set. This is mainly due to the structural differences between 
the two problems. In this paper we present a simple technique which allows 
to get rid of these structural differences, thus allowing to apply memorisa- 
tion to vertex cover in its full strength. By applying our refined technique to 
the 0(1.2918^fc^ -I- kn) algorithm of Niedermeier and Rossmanith, we obtain a 
0(1.2759^A:^'®-|-fcn) exponential-space algorithm for vertex cover. With a further 
refined technique, we reduce the complexity to 0(1.2745^fc^ -I- kn). 

2 Preliminaries 

We use standard graph notation as contained for instance in [4]. An undirected 
graph G is a pair (V, E), where R is a set of n nodes and if is a set of m pairs 
of distinct nodes {edges). The order of G is n while its size is n + m. Two nodes 
V and w are adjacent if {u, w} G E. The edge {u, w} is incident on v and w. 
The set of nodes adjacent to a node v is denoted by N{v). Given a subset W 
of nodes, hy G — W we denote the graph obtained by removing from G all the 
nodes in W and all the edges incident on them. The induced subgraphs of G are 
the graphs of the kind G — W , for any IT C R. A vertex cover of an undirected 
graph G = (R, E) is a subset G of R such that every edge in E is incident on at 
least one node in G. The vertex cover problem consists in deciding whether G 
admits a vertex cover of at most k nodes. A minimum vertex cover is a vertex 
cover of minimum cardinality. From now on we will consider a variant of vertex 
cover problem, in which the size mvc{G) of the minimum vertex covers has to 
be returned, if it does not exceed k. 
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All the currently fastest algorithms for vertex cover are based on two key- 
ideas: kernel reduction and bounded search trees. The idea behind kernel reduc- 
tion is to reduce (in polynomial time) the original problem (G, k) to an equivalent 
problem (G', fc'), where k' < k and the size of G', the kernel, is a function of k 
only. Buss and Goldsmith [2] showed how to obtain a kernel with O(fc^) nodes 
and edges in O(fcn) time. Chen, Kanj and Jia [3] showed how to reduce the 
number of nodes in the kernel to at most 2k in 0{m^/n) steps. By combining 
the kernel reductions of Buss and Goldsmith and of Chen et ah, one obtains [3] 
a 0{kn+ k^) algorithm to reduce the original problem to an equivalent problem 
(G', k'), where k' <k and the order of G' is at most 2k' . 

Let us now consider the idea behind bounded search trees. For this purpose, 
let us define a function fpvc{-, •) as follows: 



fpvc{G, k) 



mvc{G) if mvc{G) < fc; 
-|-oo otherwise. 



Given a problem (F, h), the value of fpvc{F, h) can be computed with a recur- 
sive algorithm of the following kind. If /i = 0, fpvc{F, h) is equal to zero if F 
contains no edges and it is equal to -l-oo otherwise. Otherwise, one solves (F, h) 
by branching on a set of subproblems (Fi, h\), (F 2 , ft- 2 ) • ■ • (FI, hh), with b upper 
bounded by a constant and hi < h for each i G {1, 2 ... 6}. These subproblems 
must satisfy the condition: 



fpvc{F,h)= min {h - hi + fpvc{Fi,hi)}. (1) 

ie{l,2...6} 

Thus the solution of the subproblems directly leads to the solution of (F, h) . 

To give an intuition of how subproblems are generated, let us consider a node 
u of F with neighborhood N(v) = {w, u}. Every vertex cover contains v or N(v). 
This means that F admits a vertex cover of size at most h if and only if F — {u} 
admits a vertex cover of size at most h — 1, or F — {w, m} admits a vertex cover 
of size at most h — 2. In other words: 



fpvc{F, h) = minjl -|- fpvc{F — {u}, h— I), 2 -|- fpvc{F — {w, u}, h — 2)}. 

Thus one can branch with the subproblems (F— {u}, h—1) and {F—{w, u}, h—2). 

Let G{h) denote the total number of search paths in the search tree which 
is created to solve a problem (F, h). For h = 0, C{h) = G(0) = 1. Otherwise, let 
(Fi, /ii), (F 2 , / 12 ) ■ • ■ (Fh, hb) be the subproblems generated to solve (F, h). The 
following relation holds: 

b 

C{h)<Y,C{h^). 

The inequality above is satisfied by G{h) = c^, where c is the branching factor of 
the branching considered, that is the (unique [3]) positive root of the polynomial: 

i=l 
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For example the branching factor associated with the branching at {F, h) on 
the subproblems {F — {v},h — 1) and {F — {w,u},h — 2) is the positive root 
c = 1.6180... of the polynomial — x^~'^ — x^~"^ = x^~‘^{x^ — x — 1). 

The branching factor c of the algorithm is the maximum over all the branch- 
ing factors associated to the branchings that the algorithm may execute. By 
induction, one obtains that C{h) is O(c^). This is also a valid upper bound on 
the number of nodes of the search tree. 

The idea behind bounded search trees is to exploit the structure of the graph 
such as to obtain a branching factor c as small as possible. For this purpose, a 
tedious case analysis is often required. 

Combining the recursive algorithm above with the kernel reduction algorithm 
of Chen et ah, one obtains a 0{c^k^ + kn) algorithm for vertex cover, where 
the cost of branching at a given subproblem is 0{k^). This complexity can 
be reduced to 0(c^ -I- kn) with the interleaving technique of Niedermeier and 
Rossmanith [11]. 

Let us now shortly describe how Niedermeier and Rossmanith applied mem- 
orisation to vertex cover [12,13]. Let A be an algorithm of the kind described 
above, with the extra condition that, when it branches at a problem (F, h), the 
corresponding subproblems (Fi,hi) involve graphs Fi which are induced sub- 
graph of F. In particular, let us consider the fastest algorithm which satisfies 
this extra condition, which is the 0{c^k'^ + kn) = 0(1.2918^fc^-|-fcn) algorithm of 
Niedermeier and Rossmanith [10]. From A one derives a new algorithm A! in the 
following way. For any induced subgraph F of the kernel of at most 2ak nodes, 
for a given a S (0, 1), A' solves (by using A) the problem (F, ak) and it stores the 
pair {F, fpvc{F,ak)) in a database. Note that, since F is an induced subgraph 
of the kernel, one can simply store the set of nodes of F instead of F. Then Al 
works in the same way as A, with the following difference. When the parame- 
ter in a subproblem reaches or drops below the value ak, A' performs a kernel 
reduction. This way. A' generates a new subproblem (F,h), where h < ak and 
the order of F is at most 2ak. Thus A' can easily derive the value of fpvc{F, h) 
from the value of fpvc{F, ak) which is stored in the database. 

The cost of solving each subproblem (F,ak) is 0(c“^). The number of 
pairs stored in the database is at most which is 0(fc“°-^(a“(l — 

from Stirling’s approximation. Thus the database can be created 
in 0(c“^/c~*^'®(a“(l — time. The search tree now contains 

nodes (that is the upper bound on C(fc) which is obtained by assuming C{ak) = 
1). The cost associated to each node is O(fc^), not considering the leaves for 
which the costs of the query to the database and of the kernel reduction have to 
be taken into account. In particular, the database can be implemented such as 
that the cost of each query is 0{k). Moreover each kernel reduction costs 0{k^'^) 
(since each graph contains 0{k) nodes and 0(/c^) edges). Thus the cost to create 
the search tree is 0{c^^~°‘^^k^'^). The value of a has to be chosen such as to 
balance the cost of creating the search tree and the database. The optimum is 
reached when a > 0.0262 satisfies: 
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1 — a o 
C = C 



1 



a“(l — Qf)i 



2 



Thus one obtains a + kn) = 0(1.2832^fc^'® + kn) time complexity. 

The complexity can be slightly reduced in the following way. All the nodes 
of degree greater than 6 are filtered out in a preliminary phase (in which the 
algorithm stores no solution in the database and does not perform kernel reduc- 
tions). In particular, let {F, h) be a subproblem where F contains a node v with 
|7V(u)| > 6. The algorithm branches on the subproblems {F — {v},k — 1) and 
(F — N(v), k — |fV(u)|). The number of subproblem generated in the preliminary 
phase is O(c^), where c < 1.256 < c. The cost of branching at these subproblems 
is O(fc^). Thus the total cost to remove “high” degree nodes is 0(1.256^fc^). All 
the subproblems generated after the preliminary phase, involve subgraphs with 
0(fc) nodes and edges. This means that the kernel reductions can be performed 
in 0{k^'^) time only. This way the complexity of the algorithm is reduced to 

0(1.2832'=A:i'5-kA:n)- 



3 Refined Memorisation for Vertex Cover 

In this section we present a refined way to apply memorisation to vertex cover. 
The complexity analysis are based on the 0(c^fc^ -I- kn) = 0(1.2918^fc^ -I- kn) al- 
gorithm of Niedermeier and Rossmanith [10], which is the currently fastest algo- 
rithm for vertex cover which is compatible with memorisation (since it branches 
on subproblems involving only induced subgraphs of the kernel). It is worth to 
mention that the technique proposed can be easily adapted to other algorithms 
of similar structure. 

The rest of this section is organized as follows. In Section (3.1), we 
present a variant of the exponential-space algorithm of Niedermeier and Ross- 
manith [12,13], of complexity 0(1.2829^/c^'® -I- kn). The reduction in the com- 
plexity is achieved via a more efficient use of the database. In Section (3.2), 
we show how to reduce the complexity to 0(1.2759^fc^'® -I- kn) by branching on 
subproblems involving connected induced subgraphs only. In Section (3.3), the 
complexity is further reduced to 0(1.2745^A:^ -I- kn). 



3.1 A More Efficient Use of the Database 

Our algorithm works as follows. No database is created a priori. Then, the al- 
gorithm works as the algorithm of Niedermeier and Rossmanith (as described 
in Section (2)), with the following differences. There is no value /i > 0 of the 
parameter for which the recursion is stopped. Let {F, h) be a problem generated 
after the preliminary phase. Before branching on the corresponding subprob- 
lems (Fi,hi),{F 2 ,h 2 ) . . .{Fh,hb), the algorithm applies a kernel reduction to 
each subproblem (no matter which is the value of the parameters hi). In partic- 
ular, the algorithm branches on a set of subproblems {F{,h[), (E^, . . . (E^, h'^), 

where the order of F' is at most 2/i' {order- condition) . The reason of enforcing 




66 



L. Sunil Chandran and Fabrizio Grandoni 



the order-condition will be clearer in the analysis. Observe that this does not 
modify the branching factor of the algorithm. When {F, h) is solved, the triple 
{F, h, fpvc{F, h)) is stored in a database. In this case also, before solving {F, h), 
the algorithm checks whether the solution is already available in the database. 
This way one ensures that a given subproblem is solved at most once. 

The cost of the preliminary phase (in which “high” degree nodes are removed 
from the kernel) is not modified. The cost associated to each node of the search 
tree, after the preliminary phase, is 0(fc^'®). Let P{h) denote the number of 
subproblems with parameter h (after the preliminary phase). From the analysis 
of Section (2), P{h) is 0(c^“^). The subproblems considered involve induced 
subgraphs of the kernel of order at most 2h. Let N{2k, 2h) denote the number 
of such induced subgraphs. Since no subproblem is solved more than once, P{h) 
is 0(min{c^“^, fV(2fc, 2/i)}). For h > fc/4: 

minjc'^-'*, N{2k, 2h)} < c’^~^ < = 0(1.2118'=). 

For h < fc/4, N{2k,2h) < 2 ( 2 ^). As h decreases from fc/4 to 0, the function 
min{c'=“'®, 2 ( 2 ^)} first increases, then reaches a peak, and eventually decreases. 
The peak is reached for h = ak + 0(1), where a > 0.0271 satisfies: 

^ 

Va“(l - aY~°‘ 

Thus P{h) is 0(c^^~“^'=) = 0(1.2829'=). This is also an upper bound on the num- 
ber of nodes in the search tree. Then the complexity of the algorithm proposed 
is 0(c(i-“)'=fci-5 -b kn) = 0(1.2829'=A:1'5 -b kn). 

Besides the time complexity, an important difference between the algorithm 
proposed in this section and the exponential-space algorithm of Niedermeier 
and Rossmanith is the role played by the parameter a. In the algorithm of 
Niedermeier and Rossmanith, a influences the behavior of the algorithm, and 
thus it has to be fixed carefully. In our algorithm instead, a only appears in the 
complexity analysis. 

3.2 Branching on Connected Induced Subgraphs 

In previous section we described an algorithm A which makes use of a database 
in which it stores triples of the kind (F,h,fpvc{F,h)), where F is an induced 
subgraph of the kernel of order at most 2h (order-condition). The graphs F 
stored in the database may not be connected. We will now show how to derive 
from A an algorithm Ac with the same branching factor as A, which branches 
only on subproblems involving induced subgraphs which are connected (besides 
satisfying the order-condition). 

The difference between Ac and A is the way Ac branches at a given sub- 
problem after the preliminary phase. Let (F, h) be a problem generated after 
the preliminary phase and {Fi,hi), {F 2 ,h 2 )-- - (Ft^hb) be the corresponding 
subproblems generated by A. Let moreover Fi,i,Fi ,2 • ■ ■ Fi,pi be the connected 
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components oi Fi. A naive idea could be to branch on the subproblems {Fij, hi), 
for each i € {1,2 .. .6} and for each j G (1,2 . . .pi} (the graphs Fij are con- 
nected and satisfy the order-condition). In fact, it is not hard to show that: 



fpvc{Fi,ht) 



(Tj if CTj < h^; 
-|-oo otherwise. 



where at is the sum of the solutions returned by the subproblems {Fij, hi): 

Pi 

Oi = y^fpvc{F^j,hj). 
i=i 

Once the values fpvc{Fi, hi) are available, the value of fpvc{F, h) can be easily 
derived from Equation (1). 

Though this approach is correct in principle, it may lead to a bad branching 
factor. The reason is that one may generate many more than b subproblems, 
without decreasing the parameter in each subproblem properly. To avoid this 
problem, one can use the following simple observation. The size of the minimum 
vertex covers of the connected components with less than 6£ -I- 2 nodes, for a 
fixed positive integer i, can be computed in constant time. Thus one does not 
need to branch on them. The size of the minimum vertex covers of the remaining 
connected components is lower bounded by I (since they contain at least 6^ -I- 1 
edges and their degree is upper bounded by 6). This lower bound can be used 
to reduce the parameter in each subproblem. 

In more details, for each connected component FI of Fi with less than 6^ -I- 2 
nodes, the algorithm computes mvc{F[) by brute force, and this value is added 
to a variable Ai (which is initialized to 0). Let Fi^i,Fi ^2 ■ ■ ■ Fi^b, be the remain- 
ing connected components of Fi. The minimum vertex covers of each Fij have 
cardinality at least £. This implies that, if the size of the minimum vertex covers 
of some Fij is greater than hi — {bi — 1)£, then fpvc{Fi,hi) = -l-oo. Thus one 
can replace the subproblem {Fij, hi) with a subproblem {Fij, hi — {bi — 1)1). 
Note that the order-condition is satisfied by the new subproblem since each Fij 
contains at most 2hi — {bi — 1)(6^ -I- 2) nodes. Let cr' be the sum of the solutions 
returned by the subproblems {Fij, hi — {bi — 1)^): 

bi 

cr' = ^ fpvc{F,j,h, - {b^ -1)£). 

i=i 

The value of fpvc{Fi, hi) is given by: 



fpvc{Fi,hi) 



-I- cr' if Ai + a{< hp, 
-|-oo otherwise. 



In this case also one does not need to compute the solutions of the subproblems 
{Fij, hi — {bi — 1)^) which are already available in the database. 
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Let c and cc be the branching factors corresponding to the branching on 
{F, h) of the algorithms A and Ac respectively. We can decompose the branch- 
ing of Ac on {F, h) in 5 -|- 1 branchings. First there is one branching on the 
subproblems (Fi, ft-i), {F^Ai) ■ ■ ■ {Fb, hb). Then, for each subproblem {Fi, hi), 
i G { 1, 2 ... 6 }, there is one branching on the “big” connected components of Ft. 
The first branching has branching factor c. The other branchings have branching 
factors Cl , C 2 . . . C{, respectively, where: 



G = 



1 iibi = 1 ; 

<2^/^ otherwise. 



The value of cc is upper bounded by the maximum over c and ci, 62 . . . C{,. Thus, 
to ensure that the branching factor of Ac is not greater than the branching 
factor c of c = 1.2918, it is sufficient to fix £ such that 2^/^ < c. In particular, 
we can assume ^ = 3. 

Let us now consider the complexity of Ac- The cost of the preliminary 
phase is not modified. We can both filter out small connected components 
and search in the database in 0(k) time (with a careful implementation of 
the database). Thus the cost associated to each branching is the same as in 
A, that is 0(k^'^). Let Rd{m,n) be the number of connected induced sub- 
graphs of order m which are contained in a graph of order n and degree up- 
per bounded by d. Robson [14] showed that Rd{rn,n) is O )• 

In particular, RQ{2h,2k) is 0(^(|J)^^). It turns out that this is also a valid 
upper bound for the number N(2k,2h) of the subproblems concerning graphs 
of order at most 2h which are generated after the preliminary phase. From 
this bound and the analysis of Section (3.1), the time complexity of Ac is 
0(c(i-“)fc/ji-5 _|_ _ 0(1.2759^A:^'® -I- kn), where 



log(c) 

log(c) + 21og(fJ) 



> 0.04867. 



3.3 A Further Refinement 

In Section (3.2) we showed that restricting the class of graphs which are stored in 
the database, without increasing the branching factor, leads to a faster algorithm. 
In particular, the algorithm of Section (3.2) stores connected induced subgraphs 
only (instead of general induced subgraphs). In this section we show how it is 
possible to store only connected induced subgraphs of degree lower bounded by 
2. This leads to a 0(1.2745^fc^ -I- kn) algorithm for vertex cover. 

Nodes of degree zero can be safely removed (since they do not belong to any 
minimum vertex cover). Then we just need to take care of nodes of degree one. 
Let u be a node of degree one, with N{v) = {w}. It is not hard to show that there 
exists a minimum vertex cover which contains w and does not contain v. Since 
we do not need to find all the minimum vertex covers (of a given size), but only 
one, we can simply assume that w is in the vertex cover and remove v and w from 
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the graph. Thus nodes of degree one can be filtered out in linear time. Observe 
that, if one starts with a problem {F, h) which satisfies the order-condition, the 
graph obtained after removing nodes of degree at most one satisfies the order- 
condition too. Moreover this filtering out does not increase the branching factor 
of the algorithm. 

Let i? 2 ,d(w, n) be the number of connected induced subgraphs of order m and 
degree lower bounded by 2 which are contained in a graph of order n and degree 
upper bounded by d. Robson [15] showed that i? 2 ,d(w,n) is 0{r'^ npolyd{m)), 
where, for any fixed d, polydim) is a polynomial of m and is a constant 
which comes from a maximization problem (in particular, tq < 9.927405). Fol- 
lowing the proof of Robson, it is not hard to derive that polyd{m) is 
With a more careful analysis, one obtains that polyd{m) is 0(m“2“). Thus 
R2,e{2h,2k) is 0{rl'^polye{h)k) = O(9.927405^^/i^-®fc). By replacing Re{2h,2k) 
with R2fi{2h,2k) in the analysis of Section (3.2), one obtains: 



log(c) 

log(c) -g 2 log(re) 



> 0.05282, 



and thus a 0(c*-^ -I- kn) = 0(1.2745^fc^ -I- kn) time complexity. 
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Abstract. We consider parameterized problems where some separation 
property has to be achieved by deleting as few vertices as possible. The 
following five problems are studied: delete k vertices such that (a) each 
of the given t terminals is separated from the others, (b) each of the 
given t pairs of terminals are separated, (c) exactly t vertices are cut 
away from the graph, (d) exactly I connected vertices are cut away from 
the graph, (e) the graph is separated into I components. We show that 
if both k and £ are parameters, then (a), (b) and (d) are fixed-parameter 
tractable, while (c) and (e) are W[l]-hard. 



1 Introduction 

In this paper we study five problems where we have to delete vertices from a 
graph to achieve a certain goal. In all four cases, the goal is related to making 
the graph disconnected by deleting as few vertices as possible. 

Classical flow theory gives us a way of deciding in polynomial time whether 
two vertices t\ and can be disconnected by deleting at most k vertices. How- 
ever, for every > 3, if we have £ terminals ti, t^, . . . , ti, then it is NP-hard 
to And k vertices such that no two terminals are in the same component after 
deleting these vertices [3]. In [8] a (2 — 2/^)-approximation algorithm was pre- 
sented for the problem. Here we give an algorithm that is efficient if k is small: 
in Section 2 it is shown that the Minimum terminal separation problem 
is fixed-parameter tractable with parameter k. We also consider the more gen- 
eral Minimum terminal pair separation problem where £ pairs (si,ti), . . . , 
(si,te) are given, and it has to be decided whether there is a set of k vertices 
whose deletion separates each of the £ pairs. We show that this problem is fixed- 
parameter tractable if both k and £ are parameters. Our results can be used in 
the edge deletion versions of these problems as well. 

In Section 3 we consider two separation problems without terminals. In the 
Separating £ Vertices problem exactly £ vertices have to be separated from 
the rest of the graph by deleting at most k vertices. In Separating into £ 
Components problem k vertices have to be deleted such that the remaining 

* Research is supported in part by grants OTKA 44733, 42559 and 42706 of the 
Hungarian National Science Fund. 
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Problem 


1 Parameter(s) | 


k 


£ 


k and £ 


Minimum Terminal Separation 


FPT 

(Theorem 1) 


NP-hard 
for £ > 3 [3] 


FPT 

(Theorem 1) 


Minimum Terminal Pair Separation 


Open 


NP-hard 
for £ > 3 [3] 


FPT 

(Theorem 2) 


Separating 1 Vertices 


W[l]-hard 
(Theorem 4) 


W[l]-hard 
(Theorem 4) 


W[l]-hard 
(Theorem 4) 


Separating £ Connected Vertices 


W[l]-hard 
(Theorem 8) 


W[f]-hard 
(Theorem 7) 


FPT 

(Theorem 6) 


Separating into £ Components 


W[l]-hard 
(Theorem 9) 


W[l]-hard 
(Theorem 9) 


W[l]-hard 
(Theorem 9) 



Table 1. Complexity of the problems with different parameterizations. 



graph has at least i connected components. The edge deletion variants of these 
problems were considered in [5], where it is shown that both problems are W[l]- 
hard with parameter 1. Here we show that the vertex deletion versions of both 
problems are W[l]-hard even if both k and £ are parameters. However, in the 
case of Separating £ Vertices if we restrict the problem to bounded degree 
graphs, then it becomes fixed-parameter tractable if both k and £ are parameters. 
Moreover, we also consider the variant Separating £ Connected Vertices, 
where it is also required that the separated vertices form a connected subgraph. 
It turns out that this problems is fixed-parameter tractable if both k and £ are 
parameters, but W[l]-hard if only one of them is parameter. 

The results of the paper are summarized on Table 1. 

2 Separating Terminals 

The parameterized terminal separation problem studied in this section is for- 
mally defined as follows: 

Minimum Terminal Separation 

Input: A graph G{V, E), a set of terminals T CV, and an integer k. 
Parameter 1: k 
Parameter 2: £ = \T\ 

Question: Is there a set of vertices 5 C V of size at most k such that no 
two vertices of T belong to the same connected component of G \ S'? 

Note that S and T do not have to be disjoint, which means that it is allowed 
to delete terminals. A deleted terminal is considered to be separated from all the 
other terminals (later we will argue that our results remain valid for the slightly 
different problem where the terminals cannot be deleted). 

It follows from the graph minor theory of Robertson and Seymour that Mini- 
mum Terminal Separation is fixed-parameter tractable. The celebrated result 
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of Robertson and Seymour states that graphs are well-quasi ordered with respect 
to the minor relation. Moreover, the same holds for graphs where the edges are 
colored with a fixed number of colors. For every terminal v € T, we add a new 
vertex u' and a red edge vv' (the original edges have color black) . Now separating 
the terminals and separating the red edges are the same problem. Consider the 
set fffc that contains those red-black graphs where the red edges can be separated 
by deleting at most k vertices. It is easy to see that is closed with respect 
to taking minors. Therefore by the Graph Minor Theorem, has a finite set 
of forbidden minors. Another result of Roberson and Seymour states that for 
every graph H there is an 0(|yp) algorithm for finding an iJ-minor, therefore 
membership in can be tested in 0(|yp) time. This means that for every 
k, Minimum Terminal Separation can be solved in 0(|Mp) time, thus the 
problem is (non-uniformly) fixed-parameter tractable. However, the constants 
given by this non-constructive method are incredibly large. In this section we 
give a direct combinatorial algorithm for the problem, which is more efficient. 

The notion of important separator is the most important definition in this 
section: 

Definition 1. Let G{V,E) be a graph. For subsets X,S CV, the set of vertices 
reachable from X \ S in G \ S is denoted by R{S,X). For X,Y ffV, the set 
S is called an (A, T)-separator if Y C\ R{S,X) = 0. An {X^Y)~ separator is 
minimal if none of its proper subsets are {X,Y)~ separators. An {X,Y)~ separator 
S' dominates an {X,Y)~ separator S, if \S'\ < [S'! and R{S,X) C R{S',X). A 
subset S is an important (A, A)-separator if it is minimal, and there is no 
{X ,Y) -separator S' that dominates S. 

Abusing notations, the one element set {u} is denoted by v. We note that A and 
Y can have non-empty intersection, but in this case every (A, A)-separator has 
to contain A n A. 

We use Figure 1 to demonstrate the notion of important separator. Let A = 
{x\ and Y = { 2 / 1 , 2 / 27 ^ 3 , 2 / 4 }, we want separate these two sets. A and Y can be 
separated by deleting x, this is the only separator of size 1. There are several 
size 2 separators, for example {a, /}, {b,g}, {b,j}, |c, j|. However, only |c, jj is 
an important separator: R{{c,j},x) = {x,a,b, f,g,h,i} and the set of vertices 
reachable from x is smaller for the other size 2 separators. There are two size 3 
important separators: {c,k,£} and {j,d,e}. Separator {c,h,i} is not important, 
since it is dominated both by |c, jj and by {c,k,£}. Finally, there is only one 
important size 4 separator, Y itself. 

Testing whether a given (A, A)-separator S is important can be done as 
follows. First, minimality can be easily checked by testing for each vertex s € S 
whether S \ s remains separating. If it is minimal, then for every vertex s G S, 
we test whether there is an {R{S, A) Us, A)-separator S' of size at most [S']. This 
separator can be found in 0(|yp) time using network flow techniques. If there 
is such a separator, then S is not important. Notice that if S is not important, 
then this method can be used to find an important separator that dominates S. 
The test can be repeated for S' , and if it is not important, then we get another 
separator S" that dominates S'. We repeat this as many times as necessary. 
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Since the set of vertices reachable from X increases at each step, eventually we 
arrive to an important separator. 

Let X and Y be two sets of vertices, then there is at most one important 
(X, F)-separator of size 1. A size 1 separator has to be a cut vertex (here we 
ignore the special cases where |A| = 1 or |F| = 1). If there are multiple cut 
vertices that separate X and Y , then there is a unique cut vertex that is farthest 
from X and closest to Y. This vertex will be the only important {X, ^(-separator. 

However, for larger sizes, there can be many important (A, ^(-separators of 
a given size. For an example, see Figure 2. To separate the two large cliques X 
and Y, for each 1 < z < t, either a^, or both bi and Ci have to be deleted. If we 
choose to delete both bi and c,, then we have to delete two vertices instead of 
one, but the set of vertices reachable from X increases, it includes a^. Therefore 
there are (^* 2 ) important (A, ^(-separators of size 3t/2: for t/2 of the z’s we 
delete a^, and for the remaining t/2 we delete bi and Cj. All these separators are 
important, since R{S',X) and R{S”,X) are pairwise incomparable for two such 
separators S' and S" . Thus the number of important separators of a given size 
k can be exponential in k. However, we show that this number is independent 
of the size of the graph: 

Lemma 1. For sets of vertices X,Y, there are at most 4^ important {X,Y)~ 
separators of size k. Moreover, these separators can be enumerated in polynomial 
time per separator. 

Proof. The proof is by induction on k. We have seen above that the statement 
holds for A: = 1. Let S be an important (A, ^(-separator of size k in G. We count 
how many other important separators can be in G. If H is another important 
(A, ^(-separator of size k, then we consider two cases depending on whether 
Z = S D H is empty or not. If Z is not empty, then it is easy to see that H\Z 
is an important {X\Z,Y\ ^(-separator in G\Z. Since \H\Z\ < k, thus by the 
induction hypotheses the number of such separators is at most . There 

are not more than 2^ possibilities for the set Z, and for each set Z there are at 
most 4^^“^) possibilities for the set H, hence the total number of different H 
that intersect S is at most 2^4^^“^)^. 
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X 



Fig. 2. A graph where there is an exponential number of important separators 
that separate the large cliques X and Y. 



Next we count those separators that do not intersect S. Such a separator H 
contains £ vertices from R{S, X) and k—i vertices from R{S, Y). It is not possible 
that £ = 0: that would imply that R{S, X) U 5 C R{H, X) and S would not be 
an important separator. Here we used the minimality of S: if none of R{S,X) 
and S is deleted, then every vertex of S can be reached from X. Similarly, it 
is not possible that £ = k because H would not be an important separator in 
that case. To see this, notice that by the minimality of S, from every vertex of 
S a vertex of Y can be reached using only the vertices in R{S,Y). Therefore 
no vertex of S can be reached from X in G \ H , otherwise H would not be an 
{X, F)-separator. Since S is an {X, F)-separator, thus this also means that no 
vertex of R{S,Y) can be reached. Therefore R{H,X) is contained in R{S,X), 
and since ^ > 0, the containment is proper. 

We divide H into two parts: let Hi = H H R{S,X) and H 2 = H f] R{S,Y) 
(see Figure 3). The separator S is also divided into two parts: = STl R{H, X) 

contains those vertices that can be reached from X in G\H, while S 2 = S \ Si 
contains those that cannot be reached. Let Gi be the subgraph of G induced by 
R{S,X) U S, and G 2 be the subgraph induced by R{S,Y) U S. Now it is clear 
that Hi is an {X U S'!, 52)-separator in Gi, and H 2 is a (S'!, F U S' 2 )-separator in 
G 2 . Moreover, we claim that they are important separators. First, if Hi is not 
minimal, i.e., it remains an (XUS'!, S' 2 )-separator without v G Hi, then H would 
be an (X, F)-separator without v as well. Assume therefore that an (XUS'!, S' 2 )- 
separator H* in Gi dominates Hi. In this case H* U H 2 is an (X, F)-separator 
in G with R{H,X) C R{H* U H 2 ,X), contradicting the assumption that H 
is an important separator. A similar argument shows that H 2 is an important 
(S'i,F U 52)-separator in G 2 . By the induction hypotheses, we have a bound on 
the possible number of such separators. For a given division (Si, S 2 ) and £, there 
can be at most 4^ 4(^~^) possibilities. There are at most 2^ possibilities for 
{Si, S 2 ), and the value of £ is between 1 and k — 1. Therefore the total number 
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Fig. 3. Separators in the proof of Lemma 1. 



of different separators (including S itself and the at most 2^4^^ sets in the 
first case) is at most 



fe-i 

1 + ^ < 1 + 2^4^^“^^^ + (fc — 1)2^4^^“^)^+^ 

e=i 

^ ^ 2fc+2 ^ 

what we had to show (in the first inequality we used + (fc — £)'^ < {k — 1)^ + 1, 
which holds since 1 < ^ < fc — 1). The proof also gives an algorithm for finding 
all the important separators. To handle the first case, we take every subset Z 
of S, and recursively find all the important size k — jZj separators in G \ S'. In 
the second case, we consider every 1 < £ < k — 1 and every division (Si,S 2 ) 
of S. We enumerate every important (X U Si, S 2 )-separator Si in Gi and every 
important (Si,F U S 2 )-separator in G 2 . For each Si, S 2 , it has to be checked 
whether Si U S 2 is an important (X, F)-separator. As it was shown above, every 
important separator can be obtained in such a form. Our algorithm makes a 
constant number of recursive calls with smaller k, therefore the running time is 
uniformly polynomial. □ 

What makes important separators important is that a separator in a solution 
can be always replaced by an important separator: 

Lemma 2. If there is a set S of vertices that separates the terminals t\, . . . , 
tr, then there is a set H with \H\ < |S| that also separates the terminals and 
contains an important ({G}, {0, tz, ■ ■ ■ , tr\)-separator. 

Proof. Let Sq C S be those vertices of S that can be reached from ti without go- 
ing through other vertices of S. Clearly, So is a ({ti}, {^ 2 , h,. . . , G})-separator, 
and it contains a minimal separator Si. If Si is important, then we are ready, 
otherwise there is an important ({ti}, {f 2 , ts, ■ • ■ , G})-separator S[ that domi- 
nates Si. We claim that S' = (S \ Si) U S( also separates the terminals. If this 
is true, then |S(| < |Si| implies |S'| < |S|, proving the lemma. 

Since S[ is a ({ti}, {^ 2 , ts, ■ • ■ , G})-separator, thus S' separates ti from all 
the other vertices. Assume therefore that there is a path P in G \ S' connecting 
terminals ti and tj. Since S separates ti and tj, thus this is only possible if P goes 
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through a vertex v of Si. Every vertex of Si C So has a neighbor in R{S,ti), 
let w this neighbor of v. Since R{S,ti) C R{S',ti), vertex w can be reached 
from ti in G \ S' . Therefore U can be reached from ti via w and v, which is a 
contradiction, since S' is a ({ti}, {t 2 , to,. . . , tr})-separator. □ 

Lemma 1 and Lemma 2 allows us to use the method of bounded search trees to 
solve the Minimum Terminal Separation problem: 

Theorem 1. Minimum Terminal Separation is fixed-parameter tractable 
with parameter k. 

Proof. We select an arbitrary terminal t that is not already separated from 
every other terminal. By Lemma 2, there is a solution that contains an important 
{t, T\t)-separator. Using Lemma 1, we enumerate all the at most important 
separators of size at most k, and select a separator S from this list. We delete 
S from G, and recursively solve the problem for G \ S with problem parameter 
A: — [S'!. At each step we can branch into at most directions, and the problem 
parameter is decreased by at least one, hence the search tree has height at most 
k and has at most leaves. The work to be done is polynomial at each step, 
hence the algorithm is uniformly polynomial. □ 

A natural way to generalize Minimum Terminal Separation is to have a 
more complicated restriction on which terminals should be separated. Instead 
of a set of terminals where every terminal has to be separated from every other 
terminal, in the following problem there are pairs of terminals, and every terminal 
has to be separated only from its pair: 

Minimum Terminal Pair Separation 

Input: A graph G{V,E), pairs of vertices (si,ti), ( 32 ,^ 2 ), {si,ti), 
and an integer k. 

Parameter 1: k 
Parameter 2: i 

Question: Is there a set of vertices S' C U of size at most k such that for 
every 1 < i < £, vertices Si and Q are in different components of G \ S? 

Let T = [jl^i{si,ti} be the set of terminals. We can prove an analog of 
Lemma 2: there is an optimal solution containing an important separator. 

Lemma 3. If there is a set S of vertices that separates every pair, then there is 
a set S' with |S'| < |S| that also separates the pairs and S' contains an important 
{{si\,T')- separator for some subset T' C T. 

Proof. We proceed similarly as in the proof of Lemma 2. Let T' be the set of 
those terminals that are separated from si in G \ S. Let Sq C S be the vertices 
reachable from si without going through other vertices of S. Clearly, So is an 
(si, T')-separator, and it contains a minimal (si, T')-separator Si. If Si is not 
important, then there is an important (si, T')-separator S'l that dominates Si. 
We claim that S' = (S\ Si) U S) also separates the pairs. Clearly, ti G T', hence 
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Si and ti are separated in S' . Assume therefore that Si and U are connected by 
a path P in G \ S". As in Lemma 2, path P goes through a vertex of and it 
follows that both Si and U are connected to si in G\S'. Therefore Si,ti yf T' . 
However, this implies that si is connected to Si and U in G\ S, hence S does 
not separate Si from ti, a contradiction. □ 

To find k vertices that separate the pairs, we use the same method as in The- 
orem 1. In Lemma 3, there are 2^ different possibilities for the set T', and by 
Lemma 1, for each T' there are at most different separators of size at most k. 
Therefore we can generate 2^ • /c2^ separators such that one of them is contained 
in an optimum solution. This results in a search tree with at most 2^^ • k^A^ 
leaves. 

Theorem 2. The Minimum Terminal Pair Separation problem is fixed- 
parameter tractable with parameters k and £. □ 

Separating the terminals in T can be expressed as separating pairs, hence 
Minimum Terminal Separation is a special case of Minimum Terminal 
Pair Separation. However, Theorem 2 does not imply Theorem 1. In Theo- 
rem 2 the number of pairs is a parameter, while the size of T can be unbounded 
in Theorem 1. We do not know the complexity of Minimum Terminal Pair 
Separation if only k is the parameter. 

As noted above, in the separation problems we assume that any vertex can 
be deleted, even the terminals themselves. However, we can consider the slightly 
more general problem, when the input contains a set V* of distinguished vertices, 
and these vertices cannot be deleted. All the results in this section hold for this 
variant of the problem as well. In all of the proofs, when a new separator is 
constructed, then it is constructed from vertices that were contained in some 
other separator. 

We can consider the variants of Minimum Terminal Separation and Min- 
imum Terminal Pair Separation where the terminals have to be separated 
by deleting at most k edges. The edge deletion problems received more attention 
in the literature: they were consider in e.g. [4,3,7] under the names multiway cut, 
multiterminal cut, and multicut. As noted in [8], it is easy to reduce the edge 
deletion problem to vertex deletion, therefore our algorithms can be used for 
these edge deletion problems as well. For completeness, we briefly describe a 
possible reduction. The edge deletion problem can be solved by considering the 
line graph (in the line graph L{G) of G the vertices correspond to the edges of 
G, and two vertices are connected if the corresponding two edges have a com- 
mon vertex.) However, we have to do some tinkering before we can define the 
terminals in the line graph. For each terminal Vi of G, add a new vertex p' and 
a new edge Wiw'. Let u' be the terminal instead of Vi. If edge Viv'i is marked 
as unremovable, then this modification does not change the solvability of the 
instance. Now the problem can be solved by using the vertex separation algo- 
rithms (Thereom 1 and 2) on the line graph L{G). The terminals in the line 
graph are the vertices corresponding to the edges Viv'^. These edges were marked 
as unremovable, hence these vertices are contained in the set V* of distinguished 
vertices in the line graph. 
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Theorem 3. The edge deletion versions of Minimum Terminal Separation 
(with parameter k) and Minimum Terminal Pair Separation (with param- 
eters k and I ) are fixed-parameter tractable. □ 

3 Cutting Up a Graph 

Finding a good separator that splits a graph into two parts of approximately 
equal size is a useful algorithmic technique (see [9,10] for classic examples). This 
motivates the study of the following problem, where a given number of vertices 
has to be separated from the rest of the graph: 

Separating t Vertices 

Input: A graph G{V, E), integers k and £. 

Parameter 1: k 
Parameter 2: £ 

Question: Is there a partition V = A U 5 U V such that |A| = £, [S’] < k 
and there is no edge between X and Y7 

It follows from [2] that the problem is NP-hard in general. Moreover, it is 
not difficult to show that the parameterized version of the problem is hard as 
well, even with both parameters: 

Theorem 4. Separating £ Vertices is W[l]-hard with parameters k and £. 

Proof. The proof is by reduction from Maximum Clique. Let G be a graph 
with n vertices and m edges, it has to be determined whether G has a clique 
of size k. We construct G' as follows. In G' there are n vertices vi, . . . , that 
correspond to the vertices of G, these vertices form a clique in G' . Furthermore, 
G' has TO vertices ei, . . . , Cm that correspond to the edges of G. If the end points 
of edge Cj in G are vertices f and Vj .^ , then connect vertex Cj with vertices 
and Vj^ in G'. We set £' = ( 2 ) and k' = k. 

If there is a clique of size k, then we can cut £' vertices by removing k' 
vertices. From vi, . . . , Vn remove those k vertices that correspond to the clique. 
Now the ( 2 ) vertices of G' that correspond to the edges of the clique are isolated 
vertices. On the other hand, assume that £' vertices can be cut by deleting k! 
vertices. The remaining vertices of wi, . . . , form a clique of size greater than 
£' (assuming n> -\-k), hence the P separated vertices correspond to P edges 
of G. These vertices have to be isolated, since they cannot be connected to the 
large clique formed by the remaining vfs. This means that the end vertices of 
the corresponding edges were all deleted. Therefore these P = ( 2 ) edges can 
have at most k' = k end points, which is only possible if the end points induce 
a clique of size fc in G. □ 

If we consider only bounded degree graphs, then Separating £ Vertices 
becomes fixed-parameter tractable: 

Theorem 5. Separating £ Vertices is fixed-parameter tractable with param- 
eters k, £, and d, where d is the maximum degree of the graph. 
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Proof. Consider a solution V = X U S' U F, and consider the subgraph induced 
by X U S. This subgraph consists of some number of connected components, let 
Xi U Si be the vertex set of the ith component. For each i, the pair {Si, Xi) has 
the following two properties: 

(1) in graph G the set Si separates Xi from the rest of the graph, and 

(2) XiU Si induces a connected graph. 

On the other hand, assume that the pairs (Xi,Si), ..., (Xt,St) satisfy (1), 
(2), and the sets Xi, . . . , Xt, Si, . . . , St are pairwise disjoint. In this case if 
X = Xi^J ■ ■ ■ U Xt has size exactly £ and S = SiU ■ ■ ■ U St has size at most k, 
then they form a solution. Therefore we generate all the pairs that satisfy these 
requirements, and use color coding to decide whether there are disjoint pairs 
with the required total size. If there is a solution, then this method will find one. 

By requirement (2) a pair {Xi,Si) induces a connected subgraph of size 
at most k + i. We enumerate each such connected subgraph. If a vertex v is 
contained in a connected subgraph of size at most k + £, then all the vertices of 
the subgraph are at a distance of less than k + £ from v. The maximum degree 
of the graph is d, thus there are at most vertices at distance less than k + £ 
from V. Therefore the number of connected subgraphs that contain v and have 
size at most k + £ is a constant, which means that there is a linear number of 
such subgraphs in the whole graph. We can enumerate these subgraphs in linear 
time. Each subgraph can be divided into a pair {Xi, Si) in at most 2^+^ different 
ways. From these pairs we retain only those that satisfy requirement (1). 

Having generated all the possible pairs (Xi,S'i), ..., {Xp,Sp), a solution 
can be found as follows. We consider a random coloring of the vertices with 
c := k + £ colors. Using dynamic programming, we try to find a solution where 
every vertex of X U S' has a distinct color. Subproblem {C ,j, k' , £') asks whether 
it is possible to select some pairs from the first j pairs such that (a) they are 
pairwise disjoint, (b) they use only vertices with color C , (c) the union of the Sfs 
has size k! , and (d) the union of the Xfs has size £' . For f = 0, the subproblems 
are trivial. If the subproblems for j — 1 are solved, then the problem can be 
solved for f using the following two recurrence relations. First, if subproblem 
{C ,j — l, k' ,£') is true, then clearly {C',j, k' ,£') is true as well. Moreover, if every 
vertex of Xj U Sj has distinct color (denote by Cj these colors), and subproblem 
{C \ Cj,j — l,k' — \Sj\,£' — l^jj) is true, then a solution for this subproblem 
can be extended by the pair {Xj,Sj) to obtain a solution for {C',j, k' ,£'). Using 
these two rules, all the subproblems can be solved. 

If there is a solution XUS', then by probability at least (where c = k + £ 
is the number of colors) these vertices receive distinct colors, and the algorithm 
described above finds a solution. Therefore if there is a solution, then on average 
we have to repeat the method d/d. (constant) times to find a solution. The 
algorithm can be derandomized using the standard method of /c-perfect hash 
functions, see [6, Section 8.3] and [1]. □ 

A variant of Separating £ Vertices is the Separating £ Connegted 
Vertices problem where we also require that X induces a connected subgraph 
of G. This problem is fixed-parameter tractable: 
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Theorem 6. The Separating I Connected Vertices problems is fixed- 
parameter tractable with parameters k and £. 

Proof. A vertex with degree at most k £ will be called a low degree vertex, let 
Go be the subgraph induced by these vertices. A vertex v with degree more than 
k -\- £ cannot be part of A: at most k neighbors of v can be in S, hence v would 
have more than £ neighbors in X, which is impossible if |A| = £. Therefore 
A is a connected subgraph of Gq. As in the proof of Theorem 5, a bounded 
degree graph has a linear number of connected subgraphs of size £. For each 
such subgraph, it has to be checked whether it can be separated from the rest 
of the graph by deleting at most k vertices. □ 

However, if only k is parameter, then the problem is W[l]-hard. This follows 
from the proof of Theorem 4. We construct the n + m vertex graph as before, 
but instead of asking whether it is possible to separate ( 2 ) vertices by deleting 
k vertices, we ask whether it is possible to separate n m — — k connected 

vertices by deleting k vertices. The two questions have the same answer, thus 

Theorem 7. Separating £ Connected Vertices is W[l]-hard with param- 
eter k. □ 

Similarly, the problem is W[l]-hard if only £ is the parameter. 

Theorem 8. Separating £ Connected Vertices is W[l]-hard with param- 
eter £. 

Proof. The reduction is from Maximum Clique. It is not difficult to show that 
Maximum Clique remains W[l]-hard for regular graphs. Assume that we are 
given an r-regular graph G, and it has to be decided whether there is a clique of 
size k. If r < k^, then the problem is fixed parameter tractable: for every vertex 
V, we select k — 1 neighbors of v in at most possible ways, and test whether 

these k vertices form a clique. Thus it will be assumed that r > k"^. 

Consider the line graph L(G) of G, i.e., the vertices of L{G) correspond to 
the edges of G. Set £ = ( 2 ) and k' = k{r — fc + 1). If G has a size k clique 
then the £ edges induced by the clique can be separated from the rest of the line 
graph: for each vertex of the clique, we have to delete the r — k-\-l edges leaving 
the clique. On the other hand, assume that £ vertices of G' can be separated 
by deleting k vertices. The corresponding £ edges in G span a set T of vertices 
of size t < 2£. We show that t = k, thus T is a clique of size fc in G. Assume 
that t > k. Each vertex of T has at least r — t + 1 edges that leave T. The 
corresponding t{r — t + 1) vertices have to be deleted from the line graph of G, 
hence k' > t{r — t + 1). However, this is not possible since 

t{r — t -\- 1) — k' = {t — k)r — t{t — 1) + k{k — 1) > (t — k)r — > r — k'^ > 0 

(in the first inequality we use 4£^ > m the second t > k and £< k"^ / 2). □ 
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The vertex connectivity is the minimum number of vertices that has to be 
deleted to make the graph disconnected. Using network flow techniques, vertex 
connectivity can be determined in polynomial time. By essentially the same proof 
as in Theorem 4, we can show hardness for this problem as well: 

Separating into £ Components 

Input: A graph G{V, E), integers k and £ 

Parameter 1: k 
Parameter 2: 1 

Question: Is there a set S' of A: vertices such that G \ S has at least £ 
connected components? 

Theorem 9. Separating into £ Components is W[l]-hard with parameters 
k and £. 

Proof. The construction is the same as in Theorem 4, but this time we set 
£' = ( 2 ) + 1 E^nd k' = k. By deleting the vertices corresponding to a clique of 
size k the graph is separated into £' components. The converse is also easy to 
see, the argument is the same as in Theorem 4. □ 
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Abstract. In the precoloring extension problem (PrExt) a graph is 
given with some of the vertices having a preassigned color and it has to be 
decided whether this coloring can be extended to a proper coloring of the 
graph with the given number of colors. Two parameterized versions of the 
problem are studied in the paper: either the number of precolored vertices 
or the number of colors used in the precoloring is restricted to be at most 
k. We show that these problems are polynomial time solvable but W[l]- 
hard in chordal graphs. For a graph class , let ke (resp. ^ + kv) 
denote those graphs that can be made to be a member of by deleting 
at most k edges (resp. vertices). We investigate the connection between 
PrExt in and the coloring of + ke, ^ + ve graphs. Answering 
an open question of Leizhen Cai [5], we show that coloring chordal+fce 
graphs is fixed-parameter tractable. 



1 Introduction 

In graph vertex coloring we have to assign colors to the vertices such that neigh- 
boring vertices receive different colors. In the precoloring extension (PrExt) 
problem a subset W of the vertices have a preassigned color and we have to ex- 
tend this to a proper ^-coloring of the whole graph. Since vertex coloring is the 
special case when W = %, the precoloring extension problem is NP-complete in 
every class of graphs where vertex coloring is NP-complete. See [2,7,8] for more 
background and results on PrExt. 

In this paper we study the precoloring extension problem on chordal graphs. 
PrExt is NP-complete for interval graphs [2] (and for unit interval graphs [12]), 
hence it is NP-complete for chordal graphs as well. On the other hand, if every 
color is used only once in the precoloring (this special case is called 1 -PrExt), 
then the problem becomes polynomial time solvable for interval graphs [2] , and 
more generally, for chordal graphs [11]. Here we introduce two new restricted ver- 
sions of PrExt: we investigate the complexity of the problem when either there 
are only k precolored vertices, or there are only k colors used in the precoloring. 

* Research is supported in part by grants OTKA 44733, 42559 and 42706 of the 
Hungarian National Science Fund. 
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Clearly, the former is a special case of the latter. By giving an 0{kn^'^^) time 
algorithm, we show that for fixed k both problems are polynomial time solvable 
on chordal graphs. However, we cannot expect to find a uniformly polynomial 
time algorithm for these problems, since they are W[l]-hard even for interval 
graphs. To establish W[l]-hardness, we use the recent result of Slivkins [15] that 
the edge-disjoint paths problem is W[l]-hard. 

Leizhen Cai [5] introduced a whole new family of parameterized problems. 
If ^ is an arbitrary class of graphs, then denote hy ^ — kv (resp. ^ — ke) the 
class of those graphs that can be obtained from a member of ^ by deleting at 
most k vertices (resp. k edges) . Similarly, let ^ + kv (resp. ^ -I- ke) be the class 
of those graphs that can be made to be a member of ^ by deleting at most k 
vertices (resp. k edges). For any class of graphs ^ and for any graph problem, 
we can ask what is the complexity of the problem restricted to these ’almost 
graphs. This question is investigated in [5] for the vertex coloring problem. 
Coloring ^ + kv or + ke graphs can be very different than coloring graphs in 
and might involve significantly new approaches. 

We investigate the relations between PrExt and the coloring of the modified 
graph classes. We show that for several reasonable graph classes, reductions 
are possible between PrExt for graphs in ^ and the coloring oi ,!^ + kv or 
.^ + ke graphs. Based on this correspondence between the problems, we show 
that both chordal-kfce and chordal-|-fcw graphs can be colored in polynomial time 
for fixed k, but chordal-|-fcf graph coloring is W[l]-hard. Moreover, answering 
an open question of Cai [5], we develop a uniformly polynomial time algorithm 
for coloring chordal-|-fce graphs. 

The paper is organized as follows. Section 2 contains preliminary notions. 
Section 3 reviews tree decomposition, which will be our main tool when dealing 
with chordal graphs. In Section 4, we investigate the parameterized PrExt 
problems for chordal graphs. The connections between PrExt and coloring 
ke, ^ + kv graphs are investigated in Section 5. Finally, in Section 6, we show 
that coloring chordal-f/ce graphs is fixed-parameter tractable. 



2 Preliminaries 

A C-coloring is a proper coloring of the vertices with color set C. We introduce 
two different parameterization of the precoloring extension problem. Formally, 
the problem is as follows: 

Precoloring Extension (PrExt) 

Input: A graph G{V, E), a set of colors C, and a precoloring W ^ C 
for a set of vertices W C V. 

Parameter 1: \ W\, the number of precolored vertices. 

Parameter 2: \{tp{w) \ w & W}\ = \Cw\, the number of colors used in 
the precoloring. 

Question: Is there a proper C-coloring ip' of G that extends ip (i.e., 
ip'{w) = tp{w) for every w G IF)? 
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Note that Cw Q C is the set of colors appearing on the precolored vertices, 
and can be much smaller than the set of available colors C. When we consider 
parameter 1, then the problem will be called PrExt with fixed number of pre- 
colored vertices, while considering parameter 2 corresponds to PrExt with fixed 
number of colors in the precoloring. 

For every class ^ and every fixed k, one can ask what is the complexity 
of vertex coloring on the four classes ^ + ke, ^ + kv, ^ — ke, ^ — kv. The 
first question is whether the problem is NP-complete for some fixed k. If the 
problem is solvable in polynomial time for every fixed k, then the next question 
is whether the problem is fixed-parameter tractable, that is, whether there is a 
uniformly polynomial time algorithm for the given classes. 

If ^ is hereditary with respect to taking induced subgraphs, then — kv 
is the same as hence coloring ^ — kv graphs is the same as coloring in ^ . 
Moreover, it is shown in [5] that if ^ is closed under edge contraction and has 
a polynomial time algorithm for coloring, then coloring ^ — ke graphs is fixed 
parameter tractable. Therefore we can conclude that coloring chordal— /cn and 
chordal— A:e graphs are in FPT. In this paper we show that coloring chordal-|-fce 
graphs is in FPT, but coloring chordal-b/cu graphs is W[l]-hard. 

The modulator of an ^ -I- ke graph G is a set of at most k edges whose 
removal makes G a member of Similar definitions apply for the other classes. 
We will call the vertices and edges of the modulator special edges and vertices. 
In the case of ^ -I- e and ^ — e graphs, the vertices incident to the special edges 
are the special vertices. 

When considering the complexity of coloring in a given parameterized class, 
then we can assume either that only the graph is given in the input, or that a 
modulator is also given. In the case of coloring chordal— fee graphs, this makes no 
difference as finding the modulator of such a graph (i.e., the at most k edges that 
can make the graph chordal) is in FPT [4,9] . On the other hand, the parameter- 
ized complexity of finding the modulator of a chordal-|-fce graph is open. Thus 
in our algorithm for coloring chordal-|-fce graphs, we assume that the modulator 
is given in the input. 



3 Tree Decomposition 

A graph is chordal if it does not contain a cycle of length greater than 3 as 
an induced subgraph. This section summarizes some well-known properties of 
chordal graphs. First, chordal graphs can be also characterized as the intersection 
graphs of subtrees of a tree (see e.g. [6]): 

Theorem 1. The following two statements are equivalent: 

1. G{V,E) is chordal. 

2. There exists a tree T{U,F) and a subtree T^ C T for each v GV such that 
u,v GV are neighbors in G{V, E) if and only z/ T„ n T„ yf 0. 
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Fig. 1. Nice tree decomposition of a chordal graph. 



The tree T together with the subtrees T„ is called the tree decomposition of G. 
A tree decomposition of G can be found in polynomial time (see [6,14]). 

We assume that T is a rooted tree with some root r G U. For clarity, we 
will use the word ’vertex’ when we refer to the graph G{V,E), and ’node’ when 
referring to T{U,F). For a node x G U, denote by Vx those vertices whose 
subtree contains a; or a descendant of x. The subgraph of G induced by Vx will 
be denoted by Gx = G[Fe]. For a node x GU oiT, denote by Kx the union of u’s 
where x G V{Ty). Clearly, the vertices of Kx are in Vx, and they form a clique 
in Gx, since the corresponding trees intersect in T at node x. An important 
property of the tree decomposition is the following: for every node x G U , the 
clique Kx separates 14 \ Kx and V\Vx- That is, among the vertices of 14, only 
the vertices in Kx can be adjacent to F \ 14- 

A tree decomposition will be called nice [10], if it satisfies the following 
additional requirements (see Figure 1): 

— Every node x G U has at most two children. 

— li X GU has two children y,z G U, then Kx = Ky = K^ {x is a join node). 

— li X GU has only one child y GU, then either Kx = KyU {v} (a; is an add 
node) or Kx = Ky\ {u} {x is a forget node) for some v GV . 

— li X G U has no children, then Kx contains exactly one vertex {x is a leaf 
node). 

By splitting the nodes of the tree in an appropriate way, a tree decomposition 
of G can be transformed into a nice tree decomposition in polynomial time. 

A vertex v can have multiple add nodes, but at most one forget node (the 
vertices in clique K^ of the root r have no forget nodes, but every other vertex 
has exactly one) . For a vertex v, its subtree T„ is the subtree rooted at the forget 
node of v (if it exists, otherwise at the root) and whose leaves are exactly the 
add nodes and leaf nodes of v. 

4 PrExt on Chordal Graphs 

In this section we show that PrExt can be solved in polynomial time for chordal 
graphs if the number of colors used in the precoloring is bounded by a constant 
k. The algorithm presented below is a straightforward application of the tree 
decomposition described in Section 3. The running time of the algorithm is 
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hence it is not uniformly polynomial. However, in Theorem 3 it is 
shown that the problem is W[l]-hard, hence we cannot hope to find a uniformly 
polynomial algorithm. 

Theorem 2. The PrExt problem can be solved in 0{kn^^^) time for chordal 
graphs, if the number of colors in the precoloring is at most k. 

Proof. It can be assumed that the colors used in the precoloring are the colors 1, 
2, k. For each node x of the nice tree decomposition of the graph, we solve 
several subproblems using dynamic programming. Each subproblem is described 
by a vector [«i, . . . , ak], where each Oj is either a vertex of K^, or the symbol *. 
We say that such a vector is feasible for node x, if there is a precoloring extension 
for Gx with the following property: if Oi (1 < z < fc) is *, then color i does not 
appear on the clique Kx, otherwise it appears on vertex Oj C Kx- Notice that 
in a feasible vector a vertex can appear at most once (but the star can appear 
several times), thus in the following we consider only such vectors. 

Clearly, the precoloring can be extended to the whole graph if and only if the 
the root node r has at least one feasible vector. The algorithm finds the feasible 
vectors for each node of T. We construct the feasible vectors for the nodes in a 
bottom-up fashion. First, they are easy to determine for the leaves. Moreover, 
they can be constructed for an arbitrary node if the feasible vectors for the 
children are already available. The techniques are standard, details omitted. □ 

To prove that PrExt with fixed number of precolored vertices is W[l]-hard 
for interval graphs, we use reduction from the edge disjoint paths problem, which 
is the following: 

Edge disjoint paths 

Input: A directed graph G{V, E), with k pairs of vertices (si,ti). 

Parameter: The number of pairs k. 

Question: Is there a set of k pairwise edge disjoint directed paths Pi, 

. . . , Pk such that path Pi goes from Si to til 

Recently, Slivkins [15] proved that the edge disjoint paths problem is W[l]- 
hard for directed acyclic graphs. 

Theorem 3. PrExt with fixed number of precolored vertices is W[l]-/iard for 
interval graphs. 

Proof. The proof is by a parameterized reduction from the directed acyclic edge 
disjoint path problem. Given a directed acyclic graph G(V, E) and terminal pairs 
Si, ti {I < i < k), we construct an interval graph with k' = 2k precolored vertices 
in such a way that the interval graph has a precoloring extension if and only if 
the disjoint paths problem can be solved. Let 1, 2, . . . , n be the vertices of G in 
a topological ordering. For each edge of G we add an interval [x, y). For each 
terminal pair Si, U we add two intervals [0, Si) and [ti,n+l), and precolor these 
intervals with color z. 
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Denote by ^(x) the number of intervals whose right end point is x (i.e., the 
intervals that arrive to x from the left), and by r{x) the number of intervals 
whose left end point is x. In other words, ^{x) is the number of edges entering x 
plus the number of demands starting in x. If l{x) < r{x), then add r{x) — £{x) 
new intervals [0, x) to the graph, if £{x) > r{x), then add £{x)—r{x) new intervals 
[x, n+ 1). A consequence of this is that each point of [0, n+ 1) is contained in the 
same number (denote it by c) of intervals: at each point the number of intervals 
ending equals the number of intervals starting. We claim that the interval graph 
has a precoloring extension with c colors if and only if the disjoint paths problem 
has a solution. 

Assume first that there are k disjoint paths joining the terminal pairs. For 
each edge xy, if it is used by the ith terminal pair, then color the interval [x, y) 
with color i. Notice that the intervals we colored with color i do not intersect 
each other, and their union is exactly [sj, U). Therefore, considering also the two 
intervals [0, Si) and [si,n+ 1) precolored with color i, each point of [0, n + 1) is 
covered by exactly one interval with color i. Therefore each point is contained 
in exactly c — k intervals that do not have a color yet. This means that the 
uncolored intervals induce an interval graph where every point is in exactly c—k 
intervals, and it is well-known that such an interval graph has clique number 
c — k and can be colored with c—k colors. Therefore the precoloring can be 
extended using c—k colors in addition to the k colors used in the precoloring. 

Now assume that the precoloring can be extended using c colors. Since each 
point in the interval [0,n -|- 1) is covered by exactly c intervals, therefore each 
point is covered by an interval of color i. Thus if an interval with color i ends at 
point x, then an interval with color i has to start at x. Since the interval [0, Si) 
has color i, there has to be an interval [sj, Sip) with color i. Similarly, there has to 
be an interval [sjp, with color i, etc. Continuing this way, we will eventually 
arrive to an interval [spp,ti). By the way the intervals were constructed, the 
edges SiSpi, sTpsTp, ..., Si^pti form a path from Si to ti. It is clear that the 
paths for different values of i are disjoint since each interval has only one color. 
Thus we constructed a solution to the disjoint paths problem, as required. □ 



5 Reductions 

In this section we give reductions between PrExt on ^ and coloring ^ + kv, 
J^+ke graphs. It turns out that if ^ is closed under disjoint union and attaching 
pendant vertices, then 

coloring + ke graphs ^ PrExt on with fixed |IE| ^ 
coloring ^ + kv graphs ^ PrExt on ^ with fixed \Cw\ 

When coloring ^ + ke or ^ + kv graphs, we assume that the modulator of the 
graph is given in the input. The proof of the following four results will appear 
in the full version: 
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Theorem 4. For every class ^ of graphs, coloring ^+ke graphs can be reduced 
to PrExt with fixed number of precolored vertices, if the modulator of the graph 
is given in the input. 



Theorem 5. Let be a class of graphs closed under attaching pendant vertices. 
Coloring + kv graphs can be reduced to PrExt with fixed number of colors 
in the precoloring, if the modulator of the graph is given in the input. 



Theorem 6. If is a hereditary graph class closed under disjoint union, then 
PrExt in with fixed number of precolored vertices can be reduced to the 
coloring of ^ + kv graphs. 



Theorem 7. If is a hereditary graph class closed under joining graphs at a 
vertex, then PrExt on ^ with a fixed number of colors in the precoloring can 
be reduced to the coloring of + kv graphs. 

When reducing the coloring of ^ + fce or ^ + fcw graphs to PrExt, the 
idea is to consider each possible coloring of the special vertices and solve each 
possibility as a PrExt problem. In the other direction, we use the special edges 
and vertices to build gadgets that force the precolored vertices to the required 
colors. 

Concerning chordal graphs, putting together Theorems 2-6 gives 

Corollary 1. Coloring chordal+ke and chordal+kv graphs can be done in poly- 
nomial time for fixed k. However, coloring interval-\-kv (hence chordal-\-kv) 
graphs is W[l]-hard. □ 

In Section 6, we improve on this result by showing that coloring chordal+fce 
graphs is fixed-parameter tractable. 



6 Coloring Chordal+fce Graphs 

In Section 5 we have seen that coloring a chordal-|-A:e graph can be reduced to the 
solution of PrExt problems on a chordal graph, and by Theorem 2, each such 
problem can be solved in polynomial time. Therefore chordal-l-fce graphs can be 
colored in polynomial time for fixed k, but with this algorithm the exponent 
of n in the running time depends on k. In this section we prove that color- 
ing chordal-|-A:e graphs is fixed-parameter tractable by presenting a uniformly 
polynomial time algorithm for the problem. 

Let H he & chordal-l-fce graph, and denote by G the chordal graph obtained 
by deleting the special edges of G. We proceed similarly as in Theorem 2. First 
we construct a nice tree decomposition of H . A subgraph Gx of G corresponds 
to each node x of the nice tree decomposition. Let Hx be the graph Gx plus the 
special edges induced by the vertex set of Gx. For each subgraph Hx, we try to 
find a proper coloring. In fact, for every node x we solve several subproblems: 
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each subproblem corresponds to finding a coloring of with a given property 
(to be defined later). The main idea of the algorithm is that the number of 
subproblems considered at a node can be reduced to a function of k. 

Before presenting the algorithm, we introduce a technical tool that will be 
useful. For each node x of the nice tree decomposition, the graph H* is defined by 
adding a clique of \C\ — \K^\ vertices u\, U 2 , • ■ • , u\c\-\Ko;\ to the graph and 
connecting each new vertex to each vertex of K^- The clique together with 
the new vertices form a clique of size |C|, this clique will be called K*. Instead 
of the colorings of we will consider the colorings of H*. Although H* is a 
supergraph of Hx, it is not more difficult to color than Hx- the new vertices are 
only connected to Kx, hence in every coloring of Hx there remains |C| — \Kx\ 
colors from C to color these vertices. However, considering the colorings of H* 
instead of the colorings of Hx will make the arguments cleaner. The reason for 
this is that in every C-coloring of H* every color of C appears on the clique K* 
exactly once, which makes the description of the colorings more uniform. 

Another technical trick is that we will assume that every special vertex is 
contained in exactly one special edge (recall that a vertex is called special if it 
is the end point of a special edge.) A graph can be transformed to such a form 
without changing the chromatic number, details omitted. The idea is to replace 
a special vertex with multiple vertices, and add some simple gadgets that force 
these vertices to have the same color. Since each special vertex is contained in 
only one special edge, thus each special vertex w has a unique pair, which is the 
other vertex of the special edge incident to w. 

Now we define the subproblems associated with node x. A set system is 
defined where each set corresponds to a type of coloring that is possible on H*. 
Let W be the set of special vertices, we have \W\ < 2k. Let Wx be the special 
vertices contained in the subgraph H*. In the following, we consider sets over 
K* X W. That is, each element of the set is a pair {v, w) with v £ K*, w £W. 

Definition 1. To each C-coloring tp of H*, we associate a set Sx^ip) C K* x W 
such that (v,w) £ Sx^tp) (v £ Kf, w £ Wx) if and only if ip{v) = ip{w). The set 
system T^x over Kf x W contains a set S if and only if there is a coloring ip of 
Hf such that S = Sx{ip). 

The set Sx{ip) describes ip on Hf as it is seen from the “outside”, i.e., from 
H \ Hf. In Hf only Kf and Wx are connected to the outside. Since AT* is a 
clique of size |C|, every color appears on exactly one vertex, this is the same for 
every coloring. Seen from the outside, the only difference between the colorings 
is how the colors are assigned to Wx- The set Sx{ip) captures this information. 

Subgraph Hf (hence Hx) is C-colorable if and only if the set system d^x is 
not empty. Therefore to decide the C-colorability of H, we have to check whether 
d/'x is empty, where r is the root of the nice tree decomposition. 

Before proceeding further, we need some new definitions. 

Definition 2. A set S' C Kf x W is regular, if for every w £ W, there is 
at most one element of the form (v,w) in S. Moreover, we also require that if 
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V G K* n W then (v, v) G S. The set S contains vertex w, if there is an element 
{v, w) in S for some v G K*. 

For a coloring ip of Hf, set Sxiip) is regular and contains only vertices from Wx- 



Definition 3. For a set S G K* x W, its blocker B{S) is a subset of K* x W 
such that {v,w) G B{S) if and only if{v,w') G S for the pair w' ofw. We say that 
sets Si and S 2 form a non-blocking pair if B{Si) n S '2 = 0 and Si n B{S 2 ) = 0- 

If 1/1 is a coloring of H*, then the set B{Sx{tp)) describes the requirements that 
have to be satisfied if we want to extend ip to the whole graph. For example, if 
{v, w) G Sx{ip), then this means that v G Kf has the same color as special vertex 
w. Now {v,w') G B{Sx{ip)) for the pair w' of w. This tells us that we should not 
color w' with the same color as v, because in this case the pairs w and w' would 
have the same color. 

To be a non-blocking pair, it is sufficient that one of B{Si)r\S 2 and Sir\B{S 2 ) 
is empty: 

Lemma 1. For two sets Si, S 2 G Kx x W , we have that B{Si) n S '2 = 0 i/ and 
only if Si n B{S2) = 0. 

Proof. Suppose that B(Si)nS 2 = 0, but (v,w) G SiDB{S2) (the other direction 
follows by symmetry). Since {v,w) G B{S 2 ), this means that {v,w') G S 2 where 
w' is the pair of w. But in this case (v,w) G Si implies that {v,w') G B{Si), 
contradicting B{Si) n S 2 = 0. □ 

The following lemma motivates the definition of the non-blocking pair, it 
turns out to be very relevant to our problem. If a; is a join node, then we can 
give a new characterization of dP^x, based on the set systems of its children. 

Lemma 2. If x is a join node with children y and z, then 

= {SyVI Sz '■ Sy G SFy and Sz G S^z form a non-blocking pair}. 

Proof. If S G S^x, then there is a corresponding coloring ip of H*. Coloring ip 
induces a coloring ipy (resp. ipz) of H* (resp. Hp). Let Sy (resp. Sz) be the set 
that corresponds to coloring ipy (resp. ipz). We show that Sy and Sz form a 
non-blocking pair, and S = Sy U Sz. By Lemma 1, it is enough to show that 
Syr\B{Sz) = 0. Suppose that Syr\B{Sz) contains the element {v, w) for some v G 
K* = Kp and w G Wy. By the definition of Sy, this means that ipy{v) = ipy{w). 
Since (v,w) G B{Sz), thus {v,w') G Sz for the pair w' G IF of w. Therefore 
'f’ziv) = 'f’ziw') follows. However, ipy{v) = ipz{v), hence ipy{w) = ipz{w'), which 
is a contradiction, since w and w' are neighbors, and ip is a. proper coloring of H*. 
Now we show that S = SyU Sz. It is clear that {v,w) G Sy implies {v,w) G S, 
hence SyUSz C S. Moreover, suppose that {v, w) G S. Without loss of generality, 
it can be assumed that w is contained in H*. This implies that (v,w) G Sy, as 
required. 
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Now let Sy G y'y and Sz G be a non-blocking pair, it has to be shown 
that S = Sy U Sz is in Let ipy (resp. ^pz) be the coloring corresponding 
to Sy (resp. Sz)- In general, ipy and tpz might assign different colors to the 
vertices of K* = K* = K*. However, since K* is a clique and every color 
appears exactly once on it, by permuting the colors in ipy, we can ensure that ipy 
and ipz agree on K*. We claim that if we merge i/'y and 'ipz, then the resulting 
coloring tjj is a, proper coloring of H*. The only thing that has to be verified is 
whether ip assigns different colors to the end vertices of these special edges that 
are contained completely neither in H* nor iJ*. Suppose that special vertices 
w G Wy \ Wz and w' G Wz \ Wy are pairs, but ip{w) = tp{w'). We know that 
(v,w) G Sy for some v G K*, and similarly (v\w') G Sz- By definition, this 
means that tpyiv) = ipyiw) and ipzW) = tpiw')- Since ^py and ipz assign the 
same colors to the vertices of the clique K*, thus this is only possible ii v = v' , 
implying (v,w') G Sz- However, B{Sy) also contains {v,w') contradicting the 
assumption that B{Sy)(^Sz = 0- Now it is straightforward to verify that the set 
corresponding to ip is S = SyU Sz^ proving that S G □ 

Lemma 2 gives us a way to obtain the system if a; is a join node and the 
systems for the children are known. It can be shown for add nodes and forget 
nodes as well that their set systems can be constructed if the set systems are 
given for their children. However, we do not prove this here, since this observation 
does not lead to a uniformly polynomial algorithm. The problem is that the size 
of can be 0(n^), therefore it cannot be represented explicitly. On the other 
hand, in the following we show that it is not necessary to represent the whole 
set system, most of the sets can be thrown away, it is enough to retain only a 
constant number of sets. 

We will replace y'x by a system y'* representative for S^x that has constant 
size. Representative systems and their use in finding disjoint sets were introduced 
by Monien [13] (and subsequently used also in [1]). Here we give a definition 
adapted to our problem: 

Definition 4. A subsystem SAf C ^ 3 , is representative for S^x if the following 
holds: for each regular set U C Kx x W that does not contain vertices in Wx\K*, 
if yx contains a set S disjoint from B{U), then yf also contains a set S' also 
disjoint from B{U)- We say that the subsystem yf is minimally representative 
for yx, if it is representative for yx, but it is not representative after deleting 
any of the sets from yf- 

That is, if yx can present a member avoiding all the forbidden colorings de- 
scribed by B{U), then yf can present such a member as well. For technical 
reasons, we are interested only in requirements B(U) with U as described above. 

The crucial idea is that the size of a minimally representative system can be 
bounded by a function of k independent of n (if the size of each set in yx is at 
most 2k)- This is a consequence of the following version of Bollobas’ inequality: 

Theorems (Bollobas [3]). Let (Hi, Hi), (^ 2 , 52 ), {Am,Bm) be a se- 
quence of pairs of sets over a common ground set X such that Ai n H j = 0 if 
and only if i = j- Then 
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Lemma 3. If is minimally representative for then < ( 2 ^)- 

Proof. Let = {Ai, A^t ■ ■ , Am}- Since is minimally representative for 
,5^x, therefore for every 1 < i < m, there is a regular set Bi = B{Ui) C Kx x W 
satisfying Definition 4 such that lA’x has a set disjoint from Bi, but Ai is the 
only set in disjoint from Bi (otherwise Ai could be safely removed from 
lA’f). This means that AiC\ Bi = %, and Aj C\ Bi ^ % for every z yf j. Therefore 
{Ax,Bx), (^ 2 , i? 2 ), • ■ • , {Am, Bra) Satisfy the requirements of Theorem 8, hence 
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Therefore m < ( 2 ^)) and the lemma follows. □ 

Lemma 3 shows that one can obtain a constant size representative system by 
throwing away sets until the system becomes a minimally representative. An- 
other way of obtaining a constant size system is to use the data structure of 
Monien [13] for finding and storing representative systems. Using that method, 
we can obtain a representative system of size at most This can be somewhat 
larger than given by Lemma 3, but it is also good for our purposes. 

We show that instead of determining the set system lA’x for each node, it is 
sufficient to find a set system lA’f representative for ,5^x- That is, if for each child 
z/ of a; we are given a system representative for then we can construct a 
system representative for lA’x- For a join node a;, one can find a set system 
representative for lA’x by a characterization analogous to Lemma 2: 

Lemma 4. Let x he a join node with children y and z, and let 5^* he represen- 
tative for S^y, and SAf representative for Then the system 

SAf = tySyVI Sz '■ Sy ^ SA* and Sz G lA’f form a non-hlocking pair} 

is representative for SAx . 

Proof. Since C y’y and JAf C by Lemma 2 it follows that C y’x- 
Therefore we have to show that for every regular set U not containing vertices 
from Wx \ K*, if there is a set S' € lAx disjoint from B{U), then there is a 
set S' G .A'f also disjoint from B{U). Let if be the coloring corresponding to 
set S, and let ify (resp. ipz) be the coloring of H* (resp. H*) induced by if. 
Let Sy G S^y and Sz G be the sets corresponding to ify and ifz. We have 
seen in the proof of Lemma 2 that Sy and Sz form a non-blocking pair and 
S = Sy U Sz, hence Sy is disjoint from B{U) U B{Sz) = B{U U Sz). Note that 
U does not contain vertices from Wx \ Kf, and Sz contains only vertices from 
Hf, hence U U Sz is regular, and does not contain vertices from Wy \ K*. Since 
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y* is representative for there is a set S'y G y* that is also disjoint from 
B{U U S'z). By Lemma 1, S'y n B{Sz) = 0 implies that B{S'y) n S'z = 0, hence Sz 
is disjoint from U UB{S'y) = B{UUS'y). Since y* is representative for there 
is a set S'' G y^ that is also disjoint from B{UU S'y). Applying again Lemma 1, 
we get that S'y and S' form a non-blocking pair, hence S' = S'y U S' is in S*. 
Since S' is disjoint from B{U), thus y* contains a set disjoint from B{U). □ 

If X is an add node or forget node with children y and a system y* represen- 
tative for Sy is given, then we can construct a system y* that is representative 
for y^. The construction is conceptually not difficult, but requires a tedious 
discussion. We omit the details. 

Therefore starting from the leaves, the systems y* can be constructed us- 
ing bottom up dynamic programming. After constructing y*, we use the data 
structure of Monien to reduce the size of y* to a constant. This will ensure 
that each step of the algorithm can be done in uniformly polynomial time. By 
checking whether y* is empty for the root r, we can determine whether the 
graph has a (7-coloring. This proves the main result of the section: 

Theorem 9. Coloring chordal+ke graphs is in FPT if the modulator of the 
graph is given in the input. 
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Abstract. We show that, for every finite field F, the class of all F- 
representable matroids of branch-width at most a constant t has a de- 
cidable MSO theory. In the other direction, we prove that every class 
of F-representable matroids with a decidable MSO theory must have 
uniformly bounded branch-width. 
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1 Introduction 

Monadic second order logic, which extends first order logic by allowing quan- 
tification over monadic predicates, is famous for their high expressive power in 
combination with a manageable model theory (see e.g. [12]). For this reason it 
has found many applications in different areas, as e.g. decidability, model check- 
ing, data bases, and computational complexity. 

Of special importance in this area are classes of graphs (or other structures) of 
bounded tree-width, branch-width, or clique-width, since for these classes MSO 
logic posseses besides the good model theory also very good algorithmic proper- 
ties. On the structural side, strong interest in tree-width has been motivated by 
the (now famous) Graph Minor project [19] of Robertson and Seymour which, 
besides many deep theoretical results, revolutionized the area of algorithm de- 
sign in computer science. In particular, many problems which are NP-hard for 
arbitrary structures, could be solved in polynomial and often even linear time if 
they are restricted to structures of bounded tree- width or bounded clique- width 
(see e.g. [1], or [8], [7]). 

Interestingly, general algorithmic results on efficient computability over struc- 
tures of bounded tree-width (branch-width, clique-width, etc.) come hand in 
hand with related logic results on decidability of theories. For example, for each 
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k > 0, the monadic second order theory of the class of all graphs of tree- width 
at most k, or of clique- width at most k, respectively, is decidable (see [2,6], 
or [22,23]). 

Here we shall concentrate on matroids, as a strong generalization of graphs. 
Nowadays, one can witness in the matroid community a great effort to extend 
the above mentioned Robertson-Seymour’s theoretical work on graph minors 
as far as possible to matroids, followed by important new structural results 
about representable matroids, eg. [10,11]. Inspired by those advances, we focus 
on extending the research of related complexity and logic questions from graphs 
to matroids, building on recent works [13,14] of the first author. 

Since this paper is intended for general computer-science and logic audiences, 
we provide some basic definitions concerning matroid structure, and decidability 
and interpretability of theories from mathematical logic, in the next three sec- 
tions. We bring up the MSO theory of matroids in Section 5, and present some 
related recent results there; like we show that the MSO theory of the class of all 
matroids of branch- width at most k is decidable, for every k > 0. We present our 
main result in Section 6, which extends results by the second author from [22]. 
We prove that, for every finite field F, a class of F-representable matroids with 
a decidable MSO theory must have uniformly bounded branch-width. 



2 Basics of Matroids 

We refer to Oxley [16] for matroid terminology. A matroid is a pair M = (E,B) 
where E = E{M) is the ground set of M (elements of M), and ,8 C 2^ is a 
nonempty collection of bases of M. Moreover, matroid bases satisfy the “ex- 
change axiom”; ii Bi, B 2 & B and x G B\ — B 2 , then there is y G B 2 — B\ such 
that {Bi — {a:}) U {y} G B. We consider only finite matroids. Subsets of bases are 
called independent sets, and the remaining sets are dependent. Minimal depen- 
dent sets are called circuits. All bases have the same cardinality called the rank 
r(M) of the matroid. The rank function tm{X) in M is the maximal cardinality 
of an independent subset of a set X C E{M). 

If G is a (multi)graph, then its cycle matroid on the ground set E{G) is 
denoted by M{G). The independent sets of M{G) are acyclic subsets (forests) in 
G, and the circuits of M{G) are the cycles in G. Another example of a matroid 
is a finite set of vectors with usual linear dependency. If A is a matrix, then 
the matroid formed by the column vectors of A is called the vector matroid 
of A, and denoted by M{A). The matrix A is a representation of a matroid 
M ~ M{A). We say that the matroid M{A) is W -represented if A is a matrix 
over a field F. (Fig. 1.) A graphic matroid, i.e. such that it is a cycle matroid of 
some multigraph, is representable over any field. 

The dual matroid M* of M is defined on the same ground set E, and the 
bases of M* are the set-complements of the bases of M. A set X is coindependent 
in M if it is independent in M*. An element e of M is called a loop (a coloop), if 
{e} is dependent in M (in M*). The matroid M \ e obtained by deleting a non- 
coloop element e is defined as {E — {e},B~) where B~ = {B : B G B, e ^ B}. 
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Fig. 1. An example of a vector representation of the cycle matroid M{K 4 ). The 
matroid elements are depicted by dots, and their (linear) dependency is shown 
using lines. 



The matroid Mje obtained by contracting a non-loop element e is defined using 
duality M/e = {M* \ e)* . (This corresponds to contracting an edge in a graph.) 
A minor of a matroid is obtained by a sequence of deletions and contractions of 
elements. Since these operations naturally commute, a minor M' of a matroid 
M can be uniquely expressed as M' = M\D /C where D are the coindependent 
deleted elements and C are the independent contracted elements. The following 
claim is folklore in matroid theory: 

Lemma 2.1. Let N = M \ D/C . Then a set X C E{N) is dependent in N if 
and only if there is a dependent set Y C E{M) in M such that Y — X C C. 




Fig. 2. An illustration to a 4-separation in a graph, and to a 3-separation in a 
matroid. 



Another important concept is matroid connectivity, which is close, but some- 
how different, to traditional graph connectivity. The connectivity function Am 
of a matroid M is defined for all subsets A C E hy 

Am(A) = vm{A) + rM{E — A) — r{M) + 1 . 

Here r{M) = rM{E). A subset AC E is k-separating if Xm{A) < k. A partition 
{A,E — A) is called a k-separation if A is /c-separating and both |A|, \E — A\ > k. 
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Geometrically, the spans of the two sides of a /c-separation intersect in a subspace 
of rank less than k. See in Fig. 2. In a corresponding graph view, the connectivity 
function Xg{F) of an edge subset F C E{G) equals the number of vertices of G 
incident both with F and with E{G) — F. (Then Xg{F) = Xm{G){F) provided 
both sides of the separation are connected in G.) 

3 Tree-Width and Branch-Width 

The notion of graph tree-width is well known. Let Qn denote the n x n-grid 
graph, i.e. the graph on V{Qn) = {1,2, ...,n}^ and E{Qn) = ■ 

1 < ^ II* ~ ~ / II = (Oj !}}• We say that a class S of graphs 

has bounded tree-width if there is a constant k such that any graph G € 9 has 
tree- width at most k. A basic structural result on tree- width is given in [20]: 

Theorem 3.1. (Robertson, Seymour) A graph class 9 has bounded tree-width 
if and only if there exists a constant m such that no graph G € 9 has a minor 
isomorphic to Qm- 

The same paper [20] also presents a similar, but less known, parameter called 
branch-width, and proves that branch-width is within a constant factor of tree- 
width on graphs. 

Assume that A is a symmetric function on the subsets of a ground set E. 
(Here A = Ag is the connectivity function of a graph, or A = Am of a matroid.) A 
branch decomposition of A is a pair (T, r) where T is a sub-cubic tree (A(T) < 3), 
and T is a bijection of E into the leaves of T. For e being an edge of T, the width 
of e in (T, r) equals A(A) = X{E — A), where ACE are the elements mapped 
by T to leaves of one of the two connected components of T — e. The width of 
the branch decomposition (T, r) is maximum of the widths of all edges of T, and 
branch-width of A is the minimal width over all branch decompositions of A. 

Recall the definitions of graph and matroid connectivity functions from Sec- 
tion 2. Then branch-width of A = Ag is called branch-width of a graph G, and 
that of A = Am is called branch-width of a matroid M. (See examples in Fig. 3.) 
We remark that it is possible to define matroid tree-width [15] which is within 
a constant factor of branch-width, but this is not a straightforward extension of 
traditional graph tree-width. Considering branch-width on matroids, the follow- 
ing recent result [11] is crucial for our paper: 

Theorem 3.2. (Geelen, Gerards, Whittle) For every finite field F; a class FT of 
W -representable matroids has bounded branch-width if and only if there exists a 
constant m such that no matroid N has a minor isomorphic to M{Qm)- 

4 Decidability of Theories 

We will use the following notion of a theory. Let 3C be a class of structures and 
let L be a suitable logic for X. A sentence is a set of well-formed L-formulas 
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Fig. 3. Two examples of width-3 branch decompositions of the Pappus matroid 
(top left, in rank 3) and of the binary affine cube (bottom left, in rank 4). The 
lines in matroid pictures show dependencies among elements. 



without free variables. The set of all L-sentences true in X is denoted as L- 
theory of X. We use Thi(3C) as a short notation for this theory. Hence, a theory 
can be viewed as the set of all properties, expressible in L, which all structures 
of X possess. In case that X = {G} we write Th^(G) instead of ThL(3C). Using 
this definition we obtain Thi(3C) = P|{ThL(G) : G G X}. We write Th(3C), 
ThMSo(3C) if L is first order logic, or monadic second order logic (abbreviated 
as MSO logic), respectively. 

For graphs there are actually two variants of MSO logic, commonly denoted 
by MSi and MS 2 - In MSi, set variables only denote sets of vertices. In MS 2 , 
set variables can also denote sets of edges of the considered graph. In other 
words the difference between both logics is that in MSi the domain of the graph 
consists of the vertices only and the relation is just the usual adjacency between 
vertices, while in MS 2 the domain is two-sorted and contains vertices as well as 
edges and the relation is the incidence relation. The expressive power of both 
logics was studied by Courcelle in [4]. 

A theory is said to be decidable if there is an algorithm deciding, for an 
arbitrary sentence tp G L, whether tp G Thi(3C) or not, i.e. whether (p is true 
in all structures of X. Otherwise this theory is said to be undecidahle. More 
information concerning the terminology from logic needed in this section can be 
found in classical textbooks as [9]. A good introduction into the decidability of 
theories can be found in [18] (see also [12] for monadic theories). 

To prove decidability of theories the method of model interpretability, intro- 
duced in [17] is often the best tool of choice. To describe the idea of the method 
assume that two classes of structures X and X' are given, and that L and L', 
respectively, are corresponding languages for the structures of these classes. The 
basic idea of the interpretability of theory ThL(3C) into ThL/(3C') is to transform 
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formulas of L into formulas of L' , by translating the nonlogical symbols of L by 
formulas of L' , in such a way that truth is preserved in a certain way. Here we 
assume that the logics underlying both languages are the same. Otherwise, one 
has to translate also the logical symbols. 

We explain this translation in a simple case of relational structures. First one 
chooses an L'-formula a{x) intended to define in each L'-structure G G 3C' a set 
of individuals G[a] := {a : a € dom{G) and G |= a (a)}, where dom{G) denotes 
the domain (set of individuals) of G. Then one chooses for each s-ary relational 
sign R from L an L'-formula Pr{xi, . . . , Xs), with the intended meaning to define 
a corresponding relation G[I3r\ := {(ai, . . . , Og) : ai, . . . , Og G dom{G) and G |= 
/3fl(ai, . . . , Os)}. All these formulas build the formulas of the interpretation / = 
{a{x),!3R{xi,...,Xs),...). 

With the help of these formulas one can define for each L'-structure G a 
structure G^ := (G[a], G[/3i^], . . . ), which is just the structure defined by the 
chosen formulas in G. Sometimes G^ is also denoted as /(G) and I is called an 
(L, L')-interpretation of G^ in G. In case that both L and L' are MSO languages, 
this interpretation is also denoted as MSO-interpretation. Using these formulas 
there is also a natural way to translate each L-formula Lp into an L'-formula . 
This is done by induction on the structure of formulas. The atomic formulas are 
simply substituted by the corresponding chosen formulas with the corresponding 
substituted variables. Then one may proceed via induction as follows: 

hxY ■= ixi A X 2 Y ■■= ixiY A {x 2 Y, 

(3a; x(a;))^ := 3a; (a{x) A xY^)) , 

{xeXY:=x€X, {3Xx(X)y ■■=3 XxYX). 

The resulting translation is called an interpretation with respect to L and L'. 
Its concept could be briefly illustrated with a picture: 

G L I G L' 

// G ' G G 3C' 

G^ ~H , i G 

For theories, interpretability is now defined as follows. Let X and X' be classes 
of structures and L and L' be corresponding languages. Theory Thi(3C) is said 
to be interpretable in ThL/(3C') if there is an (L, L')-interpretation / translating 
each L-formula p into an L'-formula p^ , and each L'-structure G G X' into an 
L-structure G^ as above, such that the following two conditions are satisfied: 

(i) For every structure H G X, there is a structure G G X' such that G^ = H, 

(ii) for every G G X' , the structure G^ is isomorphic to some structure of X. 

It is easy to see that interpretability is transitive. The key result for interpretabil- 
ity of theories is the following theorem [17]: 

Theorem 4.1. (Rabin) Let K. and K.' he classes of structures, and L and L' he 
suitable languages. //Thi(/C) is interpretahle in Thj;,/(/C'), then undecidahility 
o/Thi(/C) implies undecidahility of ThR>{lC). 
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5 MSO Theory of Matroids 

Considering logic point of view, a matroid M on a ground set E is the collection 
of all subsets 2® together with a unary predicate indep such that indep(F) if 
and only if C if is independent in M. (One may equivalently consider a 
matroid with a unary predicate for bases or for circuits, see in [13].) We shortly 
write MSm to say that the language of MSO logic is applied to (independence) 
matroids. If Isf is a class of independence matroids, then the MSm theory of 
is denoted by ThMSo(^)- 

To give readers a better feeling for the expressive power of MSm on a ma- 
troid, we write down a few basic matroid predicates now. 

— We write basis{B) = indep{B) A '^D(^B % DVB = D V ^ indep{D)) to 
express the fact that a basis is a maximal independent set. 

— Similarly, we write circuit(C) = ^ indep(C) AV D (^D % CVD = CVindep{D)), 
saying that C is dependent, but all proper subsets of C are independent. 

— A cocircuit is a dual circuit in a matroid (i.e. a bond in a graph). We write 
cocircuit(C) = VB [ basis(B) 3x{x € B A x € C)] A VX \_X % C V X = 
CV3B{ basis{B) AVx{x ^ BV x ^ ^))] saying that a cocircuit C intersects 
every basis, but each proper subset of C is disjoint from some basis. 

It is shown that the language of MSm is at least as powerful as that of MS 2 
on graphs. Let G l±l iL denotes the graph obtained from disjoint copies of G and 
H by adding all edges between them. The following statement is proved in [13]: 

Theorem 5.1. (Hlineny) Let G he a loopless multigraph, and let M he the cycle 
matroid of G\SK^. Then any MSO sentence (in MS 2 ) about an incidence graph 
G can he expressed as a sentence about M in MSm- 

In other words, the MSO theory of (loopless) incidence multigraphs is inter- 
pretable in a certain subclass of 3-connected graphic matroids. 

The next result we are going to mention speaks about (restricted) recogniz- 
ability of M^M-definable matroid properties via tree automata. To formulate 
this, we have to introduce briefly the concept of parse trees for representable 
matroids of bounded branch- width, which has been first defined in [13]. For a 
finite held F, an integer t > 1, and an arbitrary F-represented matroid M of 
branch- width at most t -I- 1; a t-boundaried parse tree T over F is a rooted 
ordered binary tree, whose leaves are labeled with elements of M, and the in- 
ner nodes are labeled with symbols of a certain finite alphabet (depending on 
F and t). Saying roughly, symbols of the alphabet are “small configurations” 
in the projective geometry over F. The parse tree T uniquely determines an 
F-representation (up to projective transformations) of the matroid P{T) ~ M . 
See [13] for more details and the result: 

Theorem 5.2. (Hlineny) Let W he a finite field, t>l, and let (j> he a sentence 
in the language of MSm- Then there exists a finite tree automaton Af such 
that the following is true: A t-houndaried parse tree T over F is accepted by 
Af if and only if P{T) ^ fi. Moreover, the automaton Af can be constructed 
(algorithmically) from given F, t, and (j). 
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Corollary 5.3. Let W he a finite field, t > 1, and let Tit he the class of all 
matroids representable over F of hranch-width at most t+1. Then the theory 
ThMSo(®i) is decidahle. 

Proof. Assume we are given an M^M-sentence (j). We construct the automaton 
Af from Theorem 5.2. Moreover, there is an (easily constructible [13]) automaton 
Vt accepting valid t-boundaried parse trees over F. Then “Bj ^ (j> ii and only if 
there is a parse tree accepted by Vt, but not accepted by Af . We thus, denoting 
by —Af the complement of Af, construct the cartesian product automaton A = 
{—Af) X Vt accepting the intersection of the languages of —Af and of Vt- Then 
we check for emptiness of A using standard tools of automata theory. □ 

6 Large Grids and Undecidability 

We need the following result, which was proved first in a more general form 
in [21] (see also [22]). 

Theorem 6.1. (Seese) Let X be a class of adjacency graphs such that for every 
integer k > 1 there is a graph G & X such that G has the k x k grid Qk as an 
induced subgraph. Then the MS\ theory of X is undecidahle. 

Using Theorems 3.1, 6.1 and interpretation, one concludes [22]: 

Theorem 6.2. (Seese) a) Lf a family S of planar graphs has a decidahle MS\ 
theory, then S has bounded tree-width. 

b) Lf a graph family S has a decidahle M S 2 theory, then S has bounded tree-width. 

Related results can be found also in [5] and [6]. The troubles, why part (a) of 
this theorem has to be formulated for planar graphs, lie in the fact that MS\ 
logic (unlike MS 2 ) lacks expressive power to handle minors in arbitrary graphs. 
However, that is not a problem with our MSm logic, cf. Theorem 5.1 or [14], 
and hence we can extend the (now stronger) part (b) to representable matroids 
as follows: 

Theorem 6.3. Let F &e a finite field, and let "A he a class of matroids that are 
representable by matrices over F. Lf the (monadic second-order) MSm theory 
ThMSo(ll'l) is decidahle, then the class Isf has hounded hranch-width. 

The key to the proof of this theorem is given in Theorem 3.2, which ba- 
sically states that the obstructions to small branch-width on matroids are the 
same as on graphs. Unfortunately, the seemingly straightforward way to prove 
Theorem 6.3 — via direct interpretation of graphs (Theorem 6.2) in the class 
of graphic minors of matroids in Jsf, is not so simple due to technical problems 
with (low) connectivity. That is why we give here a variant of this idea bypassing 
Theorem 6.2, and using an indirect interpretation of (graph) grids in matroid 
grid minors. 
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Remark. A restriction to F-representable matroids in Theorem 6.3 is not re- 
ally necessary; it comes more from the context of the related matroid structure 
research. According to [11], it is enough to assume that no member of 3Nf has 
a U 2 ,m- or [/|,^-minor (i.e. an m-point line or an m-point dual line) for some 
constant m. 

We begin the proof of Theorem 6.3 with an interpretation of the MSm theory 
of all minors of the class IM. To achieve this goal, we use a little technical trick 
first. Let a DC- equipped matroid be a matroid M with two distinguished unary 
predicates D and C on E{M) (with intended meaning as a pair of sets D,C C 
E{M) defining a minor N = M \ DjC). 

Lemma 6.4. Let he a class of matroids, and let J^dc denote the class of 
all DC-equipped matroids induced by members o/lM. If ThMSo(^) is decidable, 
then so is ThMSo(^Dc)- 

Proof. We may equivalently view the distinguished predicates D, C as free set 
variables in MSm- Let 4>{D,C) be an MSm formula, and N G IN. Then, by 
standard logic arguments, Ndc |= 4>{D, C) for all DC-equipped matroids Ndc 
induced by N if and only if |= \/D,C (j){D,C). Hence Afuc \= 4’{D,C) if and 
only if IN' ^ VD, C 4>{D, C). Since VD, C 4>{D, C) is an MSO formula if (j) is such, 
the statement follows. □ 



Lemma 6.5. Let be a class of matroids, and INm be the class of all minors 
of members o/lsf. Then ThMSo(^m) is interpretable in ThMSo(^Dc)- 

Proof. We again regard the distinguished predicates D,C of TIdc as free set 
variables in MSm- Let us consider a matroid Ni e Aim such that = N\Di/Ci 
for N G 'IH- We are going to use a “natural” interpretation of A^i in the DC- 
equipped matroid Njjc which results from N with a particular equipment D = 
Di, C = Cl. (Notice that both theories use the same language of MSO logic, 
and the individuals of Ni form a subset of the individuals of N.) Let ip be an 
MSm formula. The translation of ip is obtained inductively: 

— For each (bound) element variable x in ip] it is replaced with 

3x 9{x) — > 3x {x ^ C f\x ^ D f\ 6*(x)) . 

— For each (bound) set variable X in ip] it is replaced with 

3Xe{X) — > 3XWz {fz(fx\j z(fC)K{z(fX\J z(fD)h 9{X)) . 

— Every occurence of the indep predicate in ip is rewritten as (cf. Lemma 2.1) 

indep^ {X) = VT [indep{Y) V 3z{z GY f\ z ^ X f\ z ^ C)) . 
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Consider now the structure defined by indep^ in Ndc G By 

Lemma 2.1, a set X C E{N^) = E{Ni) is independent in if and only if X is 
independent in Ni, and hence is a matroid isomorphic to = TV \ D/C G 
Moreover, it is immediate from the construction of that Ni ^ r/; iff 
Xdc |= ■ Thus, / is an interpretation of ThMSo(^m) in ThMSo(^Dc)- 

Next, we define, for a matroid M, a /CC-graph of M as the graph G on the 
vertex set E{M), and edges of G connecting those pairs of elements e, / G E{M), 
such that there are a 4-element circuit C and a 4-element cocircuit C' in M 
containing both e, / G C H C". (This is not the usual way of interpretation in 
which the ground set of a matroid is formed by graph edges.) The importance 
of our definition is in that 4CC-graphs “preserve” large grids: 

Lemma 6.6. Letm > 6 be even, and M = M{Qm)- Denote by G the /CC-graph 
of M . Then G has an induced subgraph isomorphic to Qm-2- 

Proof. Recall that circuits in a cycle matroid of a graph correspond to graph 
cycles, and cocircuits to graph bonds (minimal edge cuts). The only 4-element 
cycles in a grid clearly are face-cycles in the natural planar drawing of Qm- The 
only edge cuts with at most 4 edges in Qm are formed by sets of edges incident 
with a single vertex in Qm, or by edges that are “close to the corners”. 

Let E' C E{Qm) denote the edge set of the subgraph induced on the ver- 
tices (t, j) where 1 < i,j < m. Let G' denotes the corresponding subgraph of G 
induced on E' . Choose x G E' , and assume up to symmetry x = {{i,j), {i',f)} 
where i' = i + 1 and j' = j. According to the above arguments, the only neigh- 
bours of X in G' are in the set 

E' n {{(*,i - 1 ), (*,i)}, {i,j + 1 )}, {(*',/ - 1 ), (»',/)}, {(*',/), {ej + 1 )}} • 
We now define “coordinates” for the elements x G E' as follows 

x = {{i,j),{i'J')}, <f ■ K = i + j, 4 = i + j'- 2 j. 

As one may easily check from the above description of neighbours, two elements 
x,y G E' are adjacent in G' if and only if {\kx — ky\, |4 — 41} = {Oj 1}- Hence 
the elements x G E' such that ^ + 1 < kx,ix < ^ + m — 1 induce in G' a grid 
isomorphic to Qm-2- O 

Now we are to finish a chain of interpretations from Theorem 6.1 to a proof 
of our Theorem 6.3. 

Lemma 6.7. Let M be a matroid family, and let T 4 denote the class of all 
adjacency graphs which are jCC-graphs of the members of M,. Then the MS\ 
theory 0 /T 4 is interpretable in the MSm theory ThMSo(At). 

Proof. Let us take a graph G G T 4 which is a 4CC-graph of a matroid M G M. 
Now G is regarded as an adjacency graph structure, and so the individuals (the 
domain) of G are the vertices V{G). These are to be interpreted in the ground 
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set E{M), the domain of M. Let ip be an MS\ formula. The translation ip^ in 
MSm of Ip is obtained simply by replacing every occurence of the adj predicate 
in Ip with 

adf{x,y) = 3C,C 

(ICI = \C'\ = 4 A drcuit(C) A codrcuit(C) A x,y € C A x,y € C') , 

where the matroid MSm predicates drcuit and codrcuit are defined in Section 5, 
and |Af| = 4 has an obvious interpretation in the FO logic. 

Consider the structure defined by the predicate adf on the domain E{M). 
It is ~ G by definition, for all pairs G,M as above. Moreover, adf is defined 
in the MSO logic. Hence we have got an interpretation I of ThMSOi(9^4) in 
ThMSo(iiVt). □ 

Proof of Theorem 6.3. We prove the converse direction of the implication. As- 
sume that a matroid class AT does not have bounded branch-width, and denote 
by the class of all matroids which are minors of some member of AT. By 
Theorem 3.2, for every integer m > 1, there is a matroid N G Aim isomorphic 
to the cycle matroid of the grid N ~ M{Qm). Now denote by T4 the class of all 
graphs which are 4CC-graphs of members of N^. Then, using Lemma 6.6, there 
exist members of T4 having induced subgraphs isomorphic to the grid Qk, for 
every integer A: > 1. 

Hence the class = T4 satisfies the assumptions of Theorem 6.1, and so the 
MS\ theory of T4 is undecidable. So is the MSm theory ThMSo(Nm) using the 
interpretation in Lemma 6.7, and Theorem 4.1. We analogously apply the inter- 
pretation in Lemma 6.5 to ThMSo(Nm), and conclude that also ThMSo(Ni)c) 
is undecidable, where yinc is the class of all DC-equipped matroids induced by 
N as above. Finally, Lemma 6.4 implies that the MSm theory ThMSo(N) is 
undecidable, as needed. □ 
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Abstract. We introduce a general notion of miniaturization of a prob- 
lem that comprises the different miniaturizations of concrete problems 
considered so far. We develop parts of the basic theory of miniaturiza- 
tions in this general framework. Using the appropriate logical formalism, 
we show that the miniaturization of a definable problem in W[t] lies in 
W[t], too. In particular, the miniaturization of the dominating set prob- 
lem is in W[2]. 



1 Introduction 

Parameterized complexity theory provides a framework for a refined complexity 
analysis of algorithmic problems that are intractable in general. Central to the 
theory is the notion of fixed-parameter tractahility, which relaxes the classical 
notion of tractability, polynomial time computability, by admitting algorithms 
whose runtime is exponential, but only in terms of some parameter that is usually 
expected to be small. Let FPT denote the class of all fixed-parameter tractable 
problems. A well-known example of a problem in FPT is the vertex cover prob- 
lem, the parameter being the size of the vertex cover we ask for. 

As a complexity theoretic counterpart, a theory of parameterized intractabil- 
ity has been developed. In classical complexity, the notion of NP-completeness 
is central to a nice and simple theory for intractable problems. Unfortunately, 
the world of parameterized intractability is more complex: there is a big variety 
of seemingly different classes of parameterized intractability. For a long while, 
the smallest complexity class of parameterized intractable problems considered 
in the literature was W[l], the first class of the so-called W-hierarchy. (In par- 
ticular, FPT C W[l]; and FPT 7 ^ W[l] would imply PTIME ^ NP.) 

Recently, this situation has changed: In [4], Downey et al. consider vari- 
ous problems in W[l] that, apparently, are not W[l]-hard. Most of them are 
“miniaturizations” of well-studied problems in parameterized complexity the- 
ory; for example, mini-CiRCUiT Sat is the problem that takes a circuit C of 
size < k ■ log m, where k is the parameter and m in unary is part of the input, 
and asks whether C is satisfiable. This problem is called a miniaturization of 
Circuit Sat, as the size (< k ■ logm) of C is small compared with m (under 
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the basic assumption of parameterized complexity that the parameter k is small 
too). In [4], Downey et al. introduce the class MINI[1] as the class of parame- 
terized problems fpt-reducible to mini-CiRCUiT Sat. MINI[1] now provides very 
nice connections between classical complexity and parameterized complexity as 
it is known that FPT = MINI[1] if and only if n variable 3 -Sat can be solved in 
time 2°^”b This equivalence stated in [4] is based on a result of Cai and Juedes [1]. 

Besides this “miniaturization route”, a second route to MINI[1] has been 
considered by Fellows in [7]; he calls it the “renormalization route” to MINI[1]. 
He “renormalizes” the parameterized Vertex Cover problem and considers 
the so-called k ■ logn Vertex Cover problem: It takes as input a graph G and 
as parameter a natural number k; it asks if G has vertex cover of size k ■ logn. 
This problem turns out to be MINI[l]-complete (cf. [7]). 

Before outlining the purpose and the contents of this paper let us give 
two quotations, the first one from Fellows’ paper [7] and the second one from 
Downey’s paper [3]: 

Dozens of renormalized FPT problems and miniaturized arbitrary prob- 
lems are now known to be MINI[l]-complete. However, what is known is 
quite problem specific. 

Can the hierarchy [starting with MINI[1|] be extended? 

Among others, in this paper we try to develop the theory of miniaturized prob- 
lems on a more abstract level and we address the problems mentioned in these 
quotations. Concerning the second problem, even though we introduce a hierar- 
chy of complexity classes, we conjecture, among others encouraged by the results 
of this paper, that the world of parameterized intractability in W[l] is so rich 
that, probably, there are various more or less natural hierarchies in W[l|. 

We sketch the content of the different sections. In Section 2 we give the 
necessary preliminaries. In particular, we introduce the notion of a size function, 
a polynomial time function jj jj defined on the inputs a; of a given problem 
with the property that the length |a;| of a; is polynomially bounded in ||a;||. For 
example, for a graph G = (V,E), natural choices could be jVj, the number of 
vertices, or |V| + |if|, the number of vertices and edges, or 0(|V| + |if| -log jVj), the 
total length of its binary description; but, in general, \E\ is not a size function. 

In Section 3, for a given size function jj jj, we define the concept of the 
miniaturization miniH H-Q of an arbitrary problem Q. Now, a proof essentially 
due to Cai and Juedes [1] goes through for this concept showing that miniH H- 
Q is fixed-parameter tractable just in case Qx is solvable in time In 

Proposition 2 we extend the well-known fact that a linear reduction from Q 
to Q' yields an fpt-reduction from the miniaturization of Q to that of Q' and 
essentially show that the existence of a linear reduction from Q to Q' is equivalent 
to the existence of an fpt-reduction of the miniaturization of Q to that of Q' that 
is linear with respect to the parameters. Perhaps therefore, there are so many 
not fpt-equivalent miniaturizations. 

There is a way of defining parameterized problems by means of first-order 
formulas with a free set variable X that has been dubbed Fagin- definability 
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in [8], since it is related to Fagin’s theorem characterizing NP as the class of 
S[-definable problems. For example, the parameterized clique problem is Fagin- 
definable by the formula 

VyVz((Xy A Xz f\y ^ z) ^ Eyz). 

In [5], Downey et al. showed that W[t], the tth class of the W-hierarchy, con- 
tains all parameterized problems Fagin-defined by Ili-formulas and conversely, 
there are W[t]-complete problems Fagin-defined by Ilt-formulas. In general, the 
miniaturization may increase the computational complexity of a problem; e.g., 
the parameterized vertex cover problem is fixed-parameter tractable while its 
miniaturization is not (unless MINI[1] = FPT). But the miniaturization of every 
Fagin-definable problem in W[t] lies in W[t], too. We prove this result in Section 4 
for t = I generalizing an idea used in [4] for the independent set problem. Some 
miniaturized problems considered in the literature can be regarded as miniatur- 
ization of unweighted Fagin-definable problems, a concept we introduce in this 
paper. In Section 5 we prove that in a certain sense weighted and unweighted 
definable problems have the same computational complexity. Using this result, 
we get the result on Fagin-definable problems for arbitrary t. 

As mentioned above, Ili-formulas of the form (p{X) = Vxi . . Xxt'ipiX) with 
a set variable X and with quantifier-free il^(X) are used to obtain the Fagin- 
definable problems in W[l]. We obtain a hierarchy of classes within W[l] taking 
the length t of the block of quantifiers into consideration. We study the basic 
properties of this hierarchy; in particular, we show that the (appropriate) minia- 
turization of USat is complete in the tth class of this hierarchy. Recall that 
Impagliazzo and Paturi [10] have shown that, assuming the exponential-time 
hypothesis, the complexity of USat increases with t. 

So far, when comparing the complexity of miniaturized and other parame- 
terized problems, we used many-one reductions (more precisely, FPT many-one 
reductions). In some papers, Turing reductions have been considered. As we 
show in Section 7, most problems studied in this paper are Turing equivalent. 

Finally, in Section 8 we deal with renormalizations. Besides the renormaliza- 
tion of the vertex cover problem introduced in Fellows [7] , we consider a slightly 
different renormalization and also show its fpt-equivalence to the miniaturiza- 
tion. We shall see that this result cannot be extended to arbitrary Fagin-definable 
problems, in particular not to the clique problem. 

Due to space limitations, we have to defer the proofs to the full version of 
the paper. 



2 Preliminaries 

We use standard terminology (cf. [6,8]) and therefore, only briefly recall some 
notations, definitions, and results and introduce the concept of size function. 
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2.1 First-Order Logic and Propositional Logic 

A vocabulary t is a finite set of relation symbols. A (relational) structure A of 
vocabulary t (or, simply structure), consists of a, finite set A called the universe, 
and an interpretation C A’’ of each r-ary relation symbol R G t. 

For example, let Tdrc := {E, I, O, G/y , Gy , G^}, where if is a binary relation 
symbol and I, O, G/y, Gy, G-, are unary relation symbols. We view Boolean 
circuits as Tcirc-structures 

C = {C,E^,I^,0^,G%,G^y,G^A, 

where (G, E^) is the directed acyclic graph underlying the circuit, is the set of 
all input nodes, just contains the output node, G^ , Gy , and G(; are the sets 
of and-gates, or-gates (and-gates and or-gates of arbitrary arity), and negation- 
gates, respectively. The weight of a truth value assignment to the input nodes 
of C is the number of input nodes set to true by the assignment. 

Often for graphs we shall use the more common notation G = (V, E) (or, 
G = (F(G), E{G))), where V is the set of vertices and E the set of edges. 

We define the size ||A||o of a r-structure A to be the number 

Milo ;= |A| + ^ arity(i?) • \R-^\ ■ log \A\. 

RGt 

In fact, the length of a reasonable encoding of A as a string is 0(||A||o). 

For t > 1, by lit we denote the class of all first-order formulas of the form 

Va;ii . . . Va;ifci3a:2i . . . 3x2k2 ■ ■ ■ Q^ti ■ ■ ■ Qxtkt i’l 

where Q = V if t is odd and Q = 3 otherwise, and where if is quantifier- free. St- 
formulas are defined analogously starting with a block of existential quantifiers. 

Formulas of propositional logic are built up from propositional variables 
Xi,X 2 , ... by taking conjunctions, disjunctions, and negations. We distinguish 
between small conjunctions, denoted by A, which are just conjunctions of two for- 
mulas, and big conjunctions, denoted by f\, which are conjunctions of arbitrary 
finite sets of formulas. Analogously, we distinguish between small disjunctions, 
denoted by V, and big disjunctions, denoted by \f . 

For t > 0 and d > 1 we define the sets Ft^d and At^d of propositional formulas 
by induction on t (here, by (Ai A ... A A^) we mean the iterated small conjunction 

((...((Ai AA2)AA3)...)AAA: 

ho.d := {(Ai A ... A Xr) I Ai, . . . , Ar literals and r < d}, 

Ao,d := {(Ai V ... V Xr) I Ai, . . . , Ar literals and r < d}, 

^t+i,d ■= {An I n c At^d}, 

AM,d ■■= (Vn|ncFi,4. 

Let CNF denote the class of all propositional formulas in conjunctive normal 
form. A formula is in d-CNF, if it is a conjunction of disjunctions of at most 
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d literals. Often, we identify a formula a G d-CNF with a r^-structure A(a). 
Here Td '■= {N, C}, where N is binary and C is d-ary, and A(a) has the set 
{X, ~^X I X variable of a} as universe and 

{(X, I X variable of a}; 

(jA(a) ^ I (Ai V ... V Xd) is a clause of a}, 

w.l.o.g. we assume each clause has exactly d literals. A propositional formula a 
is k-satisfiable, if there is an assignment satisfying a of weight k (i.e., setting 
exactly k variables of a to true). 



2.2 Size Functions 

Let S be an alphabet. For a string a: G S*, we denote its length by |a;|. 

Definition 1. A function || || : S* ^ N zs a size function, if it is computable in 
polynomial time and if, for some c G N, 

N<lkr 



holds for all a: G S*. 

In particular, | | is a size function. The function || ||o (cf. Section 2.1) defined 
for r-structures (more precisely, for the encodings of r-structures by strings) is 
a size function. We introduce further size functions for r-structures: 

MII+ := \A\ + ^ arity(i?) • \R-^\, 

RGt 

M||-:= |A|. 



Note that 

— for a graph G with n vertices and m edges: ||G||- = n and ||G||+ = 0(n+m); 

— for a circuit C with n gates and m lines: ||C||_ = n and ||C||+ = 0(n + m); 

~ for a propositional formula a G d-CNF with n variables and m clauses: 

||a||_ (:= ||Aa)||-) = 0(n) and ||a||+ (:= ||A(a)||+) =Q{n + m)A 



2.3 Parameterized Complexity 

A parameterized problem is a set Q C S* x H*, where S and H are finite alpha- 
bets. If (x,y) G S* X n* is an instance of a parameterized problem, we refer to 
X as the input and to y as the parameter. 

^ Note that for arbitrary propositional formulas the number of variables does not 
define a size function; for formulas a in d-CNF we obtain a size function, since we 
identify a with Afo). Equivalently, we could restrict ourselves to formulas a that 
contain every clause at most once. 
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A parameterized problem Q C S* x II* is fixed-parameter tractable, if there 
are a computable function / : N ^ N, a polynomial p, and an algorithm that, 
given a pair (x,y) G S* x II*, decides if (x,y) G Q in at most f (\y\) ■ p{\x\) steps. 

FPT denotes the complexity class consisting of all fixed-parameter tractable 
parameterized problems. 

Of course, we may view any parameterized problem Q C S* x II* as a classical 
problem, say, in the alphabet obtained from S U II by adding new symbols ‘(’, 

5 5 ) • 

We mainly consider parameterized problems Q ^ x N; so we give most 
definitions and state most results only for this case. 

To illustrate our notation, let us give two examples of parameterized prob- 
lems, the weighted satisfiability problem WSat( 0) for a class 0 of propositional 
formulas and the parameterized model-checking problem p-MC(Si[r]) for Si- 
sentences of vocabulary r: 



WSat(0) 




p-MC(Si[r]) 


Input: a in 0. 




Input: A T-structure A. 


Parameter: fc G N. 




Parameter: ip G Si[r]. 


Problem: Decide if a is k- 




Problem: Decide if A satis- 


satisfiable. 




fies p. 



For computable functions /, (/ : N ^ N, we write / G o®®((/), if / G o(g) holds 
in an effective way, i.e., if there is a computable function h such that, given any 
t' G N with £ > 0, we have f(m)/g{m) < l/£ for all m > h{£). 

Essentially the idea underlying the proof of Proposition 2 can be used to 
show that the effective versions of two notions of subexponential time, namely 
DTIME(2°<”)) and f|e>o DTIME(2<= ") coincide: 

Proposition 1. For a classical problem Q C S* the following are equivalent. 

1. Q £ DTIME(2°°*1”^), i.e., Q G DTIME(2°*®^) for some function g G o®*^^(id), 
where id denotes the identity function on N. 

2. For every rational e > 0, there is an algorithm deciding Q in time 0(2'^ ”). 
Moreover, Ag can be computed from e. 

Lemma 1 (Cai and Juedes [1]). Let Q C S* x N &e a parameterized problem. 
Assume that there is an algorithm solving Qxk in time kl) some 

computable function h G o®®(id). Then Qxk is solvable in time g(/c) + 0(|a;|) for 
some computable function g. In particular, Q G FPT. 

Complementing the notion of fixed-parameter tractability, there is a theory 
of parameterized intractability. It is based on the notion of fpt-reduction (the 
natural notion of many-one reduction of parameterized complexity theory, see 
[ 6 , 8 ]). 

We write Q Q', if there is an fpt-reduction from Q to Q', and Q Q' 
if (g <fp‘ Q' and Q' <fp‘ Q). We set 

:= {Q' I Q' Q} and [C]fP‘ := |J [QJ^p* 

Qec 
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for a class C of parameterized problems. For f > 1, the class W[t] is defined by 

W[t] := [{WSAT(Fi,d) I dGN}]^P*. 

Clearly, FPT C W[l] C W[2] . . . and it is conjectured that FPT 7 ^ W[l] (which 
would imply PTIME ^ NP). It is well-known that 

W[l] = [{p-MC(Si[r]) I T a vocabulary}]^’’*. 

3 The Miniaturization of an Arbitrary Problem 

In this section, for a classical problem Q and a size function || ||, we introduce its 
miniaturization mini^^ ^^~Q, a parameterized problem, and study the relationship 
between the complexity of Q and mini^^ ^^-Q. 

Definition 2. Let Q C S* and let || || : S* ^ N &e a size function. The pa- 
rameterized miniaturization mini^^ with respect to || || is the parameterized 
problem: 

minimi '^^-Q 

Input: n, fc G N in unary, and a; G S* . 

Parameter: k. 

Problem: Decide if ||a;|| < k ■ logn and x G Q. 



Remark 1. a) Let Q C S* and || || be a size function with |a;| < ||a;||^. The 
condition ||a;|| < k ■ logn can be checked in time polynomial in k and n only. 
Therefore, often the problem mini^^ is presented in the more appealing form: 

minimi 

Input: n, fc G N in unary, a; G S* with ||a:|| < k ■ logn. 

Parameter: k. 

Problem: Decide if a; G Q. 

b) Arguing similarly as in part a), one shows that if minimi is in FPT, then 
there is an algorithm solving minimi (on instance n, k, x) in < f{k) -pin) steps 
for some computable function / and some polynomial p. 

c) If Q G PTIME, then miniH H-Q G FPT. 

The following result relates the fixed-parameter tractability of mini^^ with 
the solvability of Q in subexponential time. Its proof uses an idea of [I] in the 
form presented in [4]. 

Proposition 2. For Q C S* and any size function || || : S* ^ N the following 
are equivalent: 

1. Q is solvable in time i.e., in time for some h G o®®(id). 

2 . minill H-Q G FPT. 
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3 . There is an algorithm that, for every instanee n,k,x 0 / minimi with 

||a;|| < k ■ logn decides if (n, k, x) € minimi in time f{k) + 0 (n) for some 
computable f. 

The implication from (1) to (2) in the following proposition corresponds to 
the well-known fact that a polynomial time “linear size” reduction between two 
problems yields an fpt-reduction of their miniaturizations: 

Proposition 3 . Let Qi C S* and Q2 C and let || ||i : S* ^ N &e a size 
function for i = 1,2. Then, the following are equivalent: 

1 . There is a function / : Sj ^ S| computable in time 2 ° such that for 

all X € S*, ||/(a ;)||2 G 0(||a:||i) and such that f is a reduction of Q\ to Q2, 
i.e., 

X &Q\ <1=^ f(x) G Q2- 

2. There is an fpt-reduction R from minimi to minimi such that for 

any instance (ni,ki,xi) 0 / minimi with ||a;i||i < k\ ■ logni we have 

R{ni,ki,xi) = (ri2, ^2,2:2) with = O(fci) and \\x2W2 < • logn2. 

Remark 2. a) Take as Q\ a language in \ DTIME(2°*^I“^I^) and 

as Q2 a language in DTIME(2° complete for EXPTIME under polynomial 
time reductions. In particular, there is a polynomial time reduction from Q\ to 
Q 2 - By Proposition 3, mini^ l-Qi ^ FPT and mini^ I-Q 2 G FPT. Hence, there 
is no fpt-reduction from mini^ ^-Q\ to mini^ ^-Q2- This example shows that the 
condition “||/(a :)||2 G 0(||a:||i)” in (1) of the preceding proposition cannot be 
weakened to “||/(a ;)||2 < g(||a;||i) for some polynomial g”. 

b) For a natural number d > 1 replace the condition k2 G 0 {k\) in (2) of 
Proposition 3 by ^2 G 0 {kf). Then, along the lines of the preceding proof, one 
can show that there is a reduction / from Q\ to Q2 according to (1) satisfying 
\\f{x)\\2&0{\\x\\f) 

We close this section with some examples. Let CircSat, Sat, and t-SAT 
denote the satisfiability problem for circuits, for propositional formulas in CNF, 
and for propositional formulas in t-CNF, respectively. In Section 2.1, for a circuit 
C, we defined ||C||o, ||C||+, and ||C||_. Essentially they are the (total) size of an 
encoding of C, the number of gates + the number of lines of C, and the number of 
gates of C, respectively. In the following, we abbreviate miniH H+-CircSat and 
minill H--CircSat by minP-CiRCSAT and mini“-CiRCSAT, respectively. The 
same notations are used for other problems. Taking as Q in Proposition 2 the 
problem CircSat, we get the following result (cf. [1,4]); it shows, for example, 
that minP-CiRCSAT G FPT is quite unlike. 

Proposition 4 . 1 . For II II G III II+, II II-}.' minimi II-CircSat G FPT if and 

only if there is a subexponential algorithm for CircSat, i.e., if there is an 
algorithm with running time 2 ° checking if the circuit C is satisfiable. 

2. mini II H‘’-CircSat G FPT. 
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By Proposition 3, the well-known linear reductions between CircSat, Sat, 
and 3 - Sat yield: 

— mini'^-CiRCSAT minP-SAT minP-3-SAT. 

Denote by VC, IS, and Clique the vertex cover problem, the independent 
set problem, and the clique problem, respectively; e.g., the instances to Clique 
consist of pairs (G, r), where G = {V, E) is a graph and r is a natural number 
with r < \V\. (G,r) G Clique if and only if there is a clique of size r in G. 
We let ||(G, r)||_ = ||G||_ and ||(G, r)||+ = ||G||+ and use the analogous notations 
for VC and IS. Clearly, the mini’*’ versions of these problems are fpt-reducible to 
their mini“ versions. Using this fact and well-known linear reductions between 
the corresponding problems, we get the following fpt-reductions between their 
miniaturized versions: 

— mini'^-3-SAT <^p* minP -VC =^p* mini'^-IS <^p* mini“-IS =^p‘ mini“-VC =^p* 
mini“-CLiQUE. 



4 The Miniaturization of Fagin-Definable Problems 

It is well-known that the parameterized halting problem HP for Turing machines, 
parameterized by the number of steps, is in W[l], even it is W[l]-complete. As 
pointed out in Downey [3], it is unknown whether minil l-HP G W[l]; we conjec- 
ture that this is not the case. In this section we show that the miniaturizations 
of Fagin-definable problems in W[l] are themselves in W[l]. 

We start by recalling the definition of Fagin-definable problem. Let t be a 
vocabulary and C a class of r-structures decidable in polynomial time. Let (f(X) 
be a first-order formula of vocabulary r that contains a free set variable X (that 
is, A is a unary second-order variable); it defines a problem W^p^x) on C: 

Input: A r-structure A in C. 

Parameter: r G N with r < \A\. 

Problem: Is there a subset S' of A of cardinality r satisfying ip{X) 
in A, i.e., with A |= 

We say that (p{X) Fagin-defines Wp,(X)(C) on C and that a parameterized prob- 
lem Q C C is Fagin-definable, if Q = IUc^(X)(C) for some (piX). 

For example, the parameterized vertex cover problem, the independent set 
problem, and the dominating set problem are Fagin-defined on the class Graph 
of all graphs by \/y'iz[Eyz (Xy V Xz)), by 'dy'izifiXy A Xz) ^Eyz), and 
by A{y = zW Eyzfij, respectively. 

If C is the class of all r-structures, we denote Wp,(X)(C) by Wp^(X)- For nota- 
tional simplicity, we formulate most results for Fagin-definable problems IU<p(jc). 
Their extensions to Fagin-definable problems Wp,(X){G) are easy. 

So far, we defined the miniaturization for classical problems only. Here and 
later, when speaking of the miniaturization of a parameterized problem Q, we 
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consider Q as a classical problem as explained in Section 2.3. Again for a struc- 
ture A and r G N with r < |A|, we set ||(A, r)|j_ = |1A||_ and ||(A, r)||+ = |]A|j+. 
For example for a formula ip(X) of vocabulary r, mim~ -Wtp(x) is the problem 

Input: n, A: G N in unary, a r-structure A with |A| < fc • logn, 
and r G N with r < \A\. 

Parameter: k. 

Problem: Is there a subset S' of A of cardinality r with A |= 7 >(S)? 

It was shown in [5] (see [8]) that W[t] = [{VF,, 3 (jC) | ^ Hi-formula}] 

The following theorem shows that the miniaturization with respect to || ||_ 
(and hence, its miniaturization with respect to || ||+) of a Fagin-definable prob- 
lem in W[l], lies in W[l], too. In [4], it was shown that mini“-IS is reducible to 
the parameterized independent set problem, a problem in W[l]. There, a Tur- 
ing reduction was exhibited. We generalize the idea underlying this reduction 
appropriately to obtain our result. 

Theorem 1. Let he a Ai-formula. Then, u\\ni~ -W,p(X) G W[l]. 

Above we gave definitions of VC and IS by means of Ili-formulas, hence: 
Corollary 1. mini” -VC, mini” -IS G W[l]. 

5 The Miniaturization of Unweighted Definable Problems 

In general, in parameterized complexity we consider weighted satisfiability prob- 
lems; often, in the context of miniaturizations they are considered in the un- 
weighted form. In this section, for Fagin-definable problems we show that the 
miniaturized weighted and unweighted problems have the same computational 
complexity. Using this result, we prove, for every t > I, that the miniaturization 
of a Fagin-definable problem in W[t] lies in W[t], too. 

Definition 3. Let t he a vocabulary. Let tp(A) be a first-order formula of vo- 
cabulary T with the free set variable X; it defines a classical problem U,p(X) given 
by: 

Uip(X) 

Input: A r-structure A. 

Problem: Is there a subset S' of A with A \= 7 ?(S)? 



Theorem 2. For q>3 and t > 1.- 

1. [{minU-{7c^(X) | G = [{tAyiT -W,^(X) \ G Ily,}]^’^*. 

2. [{mini”-C/,,,(jf) | ip{X) G Ili.q}]^'’* = [{mini”-IU<^(X) | G IIy,}]^'^‘. 

3. [{minU-U^(X) | AX) G = [{minT-W^(X) \ t(X) G 

4 . [{minr -U,p(X) \ t(X) G = [{mini”-lU,(jf) | AX) G IIJ]^'’*. 
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By Theorem 1 this implies: 

Corollary 2. If (p(X) G Hi then mini” mini'*' G W[l]. 

Theorem 3. Let t > 1 and ip{X) G lit. Then mini'*'-ITc^(X), mini” -lT<p(jf) G 
W[t]. 

Corollary 3. mini”-DOMiNATiNG Set G W[2]. 

Recall that the parameterized dominating set problem (p-DS) is complete for 
W[2], so Corollary 3 implies that if p-DS G FPT, then the dominating set problem 
can be decided in time 2° Moreover by a refined argument, similar to that 
needed to prove Theorem 3, one can show the following result analogous to a 
result concerning the clique problem proved in [2]. 

Theorem 4. Ifp-DS can he decided in time f(k) ■ n° for a computable func- 
tion f , then both the dominating set problem and the clique problem are solvable 
in time 

This improves a result stated in [7]. 

6 The M“-hierarchy 

The previous analysis suggests the definition of a hierarchy within W[l], taking 
the number of universal quantifiers into account: For t > 1, we introduce the 
class M”[t] by 

M”[t] := [{mini”-C/,p(jf) | ip{X) G 
By Corollary 2, M”[t] C W[l]; therefore, 

M”[l] C M”[2] C ... C M”[t] C ...W[l]. (1) 

By Theorem 2, for t > 3, 

M”[t] = [{mini”-IT^(X) | ip(X) G 



Theorem 5. For all t >2, mini T-Sat is complete in M [t]. 

It is well-known that 2 -Sat is in PTIME, thus mini”-2-SAT G FPT by part c) of 
Remark 1. Hence, (M”[l] =)M”[2] = FPT by the preceding theorem. 

Is the hierarchy in (I) (starting with t = 2) strict? By the preceding theorem, 
we know that mini”-t-SAT is M” [t]-complete; hence, in connection with this 
problem one should mention the result of Impagliazzo and Paturi [10] that, 
assuming the exponential-time hypothesis, the complexity of CSat increases 
with t. 
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7 MINI[1] and Turing Reductions 

In [4], the class MINI[1] (sometimes denoted by M[l]) was introduced: 

MINI[1] := [mini'^-CiRCSAT]^P‘. 

Since mini'*‘-CiRCSAT mini’^-3-SAT mini -3 -Sat, we know by Theo- 
rem 5 that MINI[1] C M“[3]. 

Sometimes (e.g., in [4,7]), in connection with the class MINI[1], Turing reduc- 
tions (more precisely, parameterized Turing reductions) have been considered; 
the fpt-reductions considered so far in this paper were many-one reductions. 

From the point of view of Turing reductions nearly all problems considered 
here have the same complexity. In fact, it has been implicitly shown in Im- 
pagliazzo, Paturi and Zane [11] that for t > 3, there is a parameterized Turing 
reduction from mini“-t-SAT to mini'''-t-SAT, hence, these two problems are Tur- 
ing equivalent. In particular, if we denote by MINIt[1] and M^[t] the closure 
under parameterized Turing reductions of MINI[1] and M“[t], respectively, we 
have MINIt[1] = M^[3], but also = M^[3] for all t > 3. 

8 Renormalizations 

In the context of miniaturized problems two “renormalizations” of VC have been 
considered, (fcdog n)‘^-VC and (fcdog n)“-VC. Let || || be an arbitrary size function 
on the class of graphs. Define (k ■ logn)H H-VC by 

(/c-logn)ll ll-VC 

Input: G = (V, E). 

Parameter: /c G N. 

Problem: Does G have a vertex cover of size k ■ log ||G||? 

Clearly, (k ■ logn)“-VC and (k ■ logn)'^-VC denote (k ■ logn)H H--VC and {k ■ 
logn)ll II+-VC, respectively. We show that both problems, (fc • logn)“-VC and 
(k ■ logn)'*'-VC, are fpt-equivalent to mini“-VC. The equivalence of the first and 
the third problem is stated in [7]. There, no proof is given and hence, we do not 
know if [7] refers to Turing reductions or to many-one reductions. 

Theorem 6. (fc • logn)“-VC, (/c • logn)'^-VC, and mini“-VC are ipi- equivalent. 

Remark 3. In the same way, one could define the k ■ log n renormalizations of 
various parameterized problems, for example, of all Fagin-definable ones. Since 
we have no substantial results for the general case, we introduced the notion 
of renormalization for the vertex cover problem directly. In fact, the preceding 
theorem does not generalize to Clique: 

Claim: If FPT ^ W[l], then {k ■ logn)“-CLiQUE mini“-CLiQUE. 

For the reader familiar with the terminology of [9], we state the following gen- 
eralization of this claim: 
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For i > 2, if W[l] ^ FPT, then (fc • logn) W^p(X) mini W^p(X) for 
every generic Ilt/i-formula ^{X). 

In particular, for the dominating set problem, we have 

If W[2] yPPT, then {k ■ logn)“-DS mini“-DS. 

9 Conclusions 

We have introduced a general notion of miniaturization of a problem that com- 
prises the different miniaturizations of concrete problems considered so far. Using 
the appropriate logical formalism, we were able to show that the miniaturizations 
of definable problems in W[t] are in W[t], too. Based on this logical formalism 
we introduced a hierarchy of complexity classes in W[l]. 
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Abstract. We demonstrate a kernel of size O(k^) for 3-Hitting Set 
(Hitting Set when all subsets in the collection to be hit are of size 
at most three), giving a partial answer to an open question of Nieder- 
meier by improving on the 0{k^) kernel of Niedermeier and Rossmanith. 
Our technique uses the Nemhauser- Trotter linear-size kernel for Vertex 
Cover, and generalizes to demonstrating a kernel of size for 

r-HiTTiNG Set (for fixed r). 



Keywords: hitting set, fixed parameter algorithms, kernels 

1 Introduction 

Kernelization is a central technique in the development of fixed-parameter tra- 
ctable algorithms. Intuitively, a kernelization for a parameterized problem V is 
a polynomial-time algorithm that, given any instance X, either determines that 
it is a “no” instance, or finds another instance 2' (with perhaps a modified 
parameter value) such that X' is a “yes” instance if and only if X is, and the 
size of X' (which is the kernel) is bounded by some function of k. Since many 
problems have exponential-time brute-force algorithms, applying one to X' yields 
a fixed-parameter tractable algorithm for V. Although technically any problem 
admitting a fixed-parameter tractable algorithm has a kernel, it is more useful to 
discover kernelization techniques as an aid to finding fixed-parameter tractable 
algorithms than to extract kernels from such algorithms. 

This paper describes an algorithm for finding kernels for variations of the 
Hitting Set problem, one of the original NP-complete problems from Karp’s 
1972 paper [6]. In order to describe these variations, we need some terminology. 
Given two sets S and T, we say S hits T (or T is hit S') if S' n T yf 0. We 
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apologize for the violent metaphor, which we inherited; our preference would be 
to use touches or meets. 

r-HiTTiNG Set 

Input: A collection C of subsets of a finite set S such that for every T € C, 
\T\ <r. 

Parameter: A non-negative integer k. 

Question: Is there a subset H of S such that \H\ < k and for every T € C, H 
hits T? 

We describe an instance of r-HiTTiNG Set by a tuple X = {S,C, k). For any 
r >2, r-HiTTiNG Set is NP-complete, since for r = 2 it is simply the Vertex 
Cover problem {S is the set of vertices and C the set of edges). 

Because of the set-theoretic nature of r-HiTTiNG Set, kernels for it have 
a slightly stronger property than described in the general introduction above. 
Given a collection C C 2‘® and a subset S' of S, we denote by C|s/ the collection 
of sets in C restricted to S' , that is, the set {Tn S" | T G C, Tn S" yf 0}. A kernel 
for an instance of r-HiTTiNG Set is a subset K of S such that any solution to 
the instance X' = {K,C\k, k) of r-HiTTiNG Set is a solution to X. 

If we can find a kernel of size /(fc) for r-HiTTiNG Set, we can solve any 
instance in time 0{f{k)^n) by simply trying all subsets of the kernel to see if they 
are solutions to the restricted problem. As we mentioned above, this approach 
works for other kernelizable problems, though in the case of r-HiTTiNG Set an 
approach based on bounded search trees is faster [9], and kernelization serves 
mainly to bring the running time of the search algorithm down from O(c^n) to 
0{c^ f{k) + n) (for some constant c). The ultimate goal is to find a linear-sized 
kernel for problems, because this brings down the cost of the brute-force search, 
and implies the existence of a constant-factor approximation algorithm. 

2 Reducing Kernel Size 

The first instance of kernelization most students of parameterized complexity [5] 
see is the O(fc^) kernel for Vertex Cover due to S. Buss [3, cited in [2]]. 
But Vertex Cover is one of the few problems known to have a linear-sized 
kernel, provided by the following theorem due to Nemhauser and Trotter [7] 
(with improved running time due to Chen et al. [4]). 

Theorem 1. There is an algorithm running in time 0{kn + k^) that, for any 
instance X = (S,C,k) o/ V ertex Cover with input size n, either computes a 
kernel K C S, \K\ < 2k, or proves that X is a “no” instance. 

The proof of Theorem 1 makes elegant use of the linear programming relax- 
ation of the integer program for Vertex Cover, but we do not need to know 
anything about the proof; we will use the algorithm as a black-box subroutine 
in computing our kernel for r-HiTTiNG Set. To illustrate the method, we first 
demonstrate a kernel of size 6/c^ for 3-Hitting Set. Theorem 2 below improves 
the 0{k^) kernel for 3-Hitting Set given by Niedermayer and Rossmanith [9], 
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which is basically a generalization of Buss’s 0{k^) kernel for Vertex Cover. 
We save a factor of k by using the fact that Nemhauser and Trotter give a kernel 
of size 0{k), not 0{k^), for r-HiTTiNG Set in the case r = 2, though an entirely 
different technique is employed to make use of this fact. Niedermeier [8, p. 31] 
lists as an open problem the use of Nemhauser-Trotter techniques to improve 
the size of the kernel for r-HiTTiNG Set. 

Theorem 2. There is an algorithm running in time 0{kn + that, for any 
instanee X = (S,C,k) o/ 3 -Hitting Set with input size n, either computes a 
kernel K C S, \K\ < 6k^, or proves that I is a “no” instance. 

Proof. Our algorithm actually does something slightly stronger; it computes a 
set F C S' of elements that must be in any solution to X, and a set M C S of 
elements that may be in the solution, with |F U M\ < 6k^. (Many kernelization 
algorithms do this, including that of Theorem 1, though we do not use this fact.) 

We start by forming a collection Q C using the following greedy algorithm: 
Start with Q empty, and repeatedly choose an arbitrary set C G C. Add C to G, 
and delete from C any set with a nonempty intersection with C. Repeat until C 
is empty. This takes 0{n) time. 

If jtjj contains more than k sets, I is a “no” instance, because any two sets 
in G are disjoint, and so more than k elements are required to hit them all. 
If jtjj < k, we proceed to construct the kernel. Let E be the set of elements 
appearing in the sets in G, that is, E = IJceG ^ must hit every set in C (if it 
did not hit some set, the set would not have been deleted in the algorithm that 
created G, which is a contradiction), and |F| < 3k. Thus F is a hitting set, but 
if a hitting set of size k exists, it may not be contained within E. We will use 
the elements of E to construct our kernel. 

For each e G E, we define Ce to be the collection of sets in C containing the 
element e, that is,Ce = {T|eGT, T gC}. Think of Ce as a subproblem induced 
by the element e. We also define C' to be the sets in Ce but with the element e 
removed, that is, C' = {T \ {e} | T G Ce}. Note that Xe = {S \ {e},C', k) is an 
instance of Vertex Cover, since every set in C' has size at most two. Since 
every T G Ce must be hit, we will either choose e to be in the hitting set, or we 
find a hitting set for Xe. The former can be achieved by adding e to E , and the 
latter by adding to M a kernel for the instance Xe of Vertex Cover. Doing 
this for every element e G E will give us the sets F and M that we seek. 

Applying Theorem 1, either Xe is a “no” instance, or we can find a kernel Ke 
of size at most 2k for Xe in time 0{kue + k^) (where Ue is the size of the instance 
Xe). If Xe is a “no” instance, e must be in any solution H for X. Suppose it is 
not. Then in order for E[ to hit all the sets in Ce, El would have to be a solution 
to Xe. But since Xe is a “no” instance, this would make the size of El greater 
than k, which is a contradiction to H being a solution for X. Thus if Xg is a “no” 
instance, we add e to F. If instead it is a “yes” instance, we add Kg to M. 

Since, for each of the at most 3k elements of E, we either added one element 
to F or at most 2k elements to M, |F U M| < 6fc^, as required. 

We claim that the set K = FU M is a kernel for X. To see this, suppose that 
X' is the problem defined by the candidate kernel K, that is, X' = (K,C\K,k). 
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If X' is a “no” instance, then X must be as well, since if i/ is a solution for I, 
H O K will be a solution for X' . If X' is a “yes” instance, then let H' C K he a 
solution. Since every element added to F must be in any solution, we know that 
F C H'. We need to show that H' is a solution for X. To do this, we take an 
arbitrary set T in C, and show that H' hits it. 

If F hits T, we are done, so suppose it does not. Since E hits T, there must 
be some element e in if n T; if e is in the hitting set FI' , then T is hit by H', 
since it is hit by e. If e is not in H' , then since F does not hit T, e is not in F. 
It follows that Xe is a “yes” instance, and Kg C M . 

Since is a kernel for Ig, Ke must hit T \ {e} (because Ke contains a 
solution for Ig, which is a “yes” instance). Since T H is nonempty, it is a set 
in C\k- Since F[' is a hitting set for C|/c, it must hit TfiKe, and therefore it hits 
T, as required. 

The running time of the procedure is 0(n) to find E plus 0{kne + k^) 

to find F and M , where rig is the number of sets in C that contain e. Since every 
set in C contains at most three elements, X^g < 3n, and since \E\ < 3k, the 
total running time is 0{kn + fc^). □ 

In proving the above theorem, we used a subroutine for kernelizing Vertex 
Cover to create an algorithm for kernelizing 3 -Hitting Set. We can continue 
this process, using a subroutine for kernelizing (r — 1 )-Hitting Set to create 
an algorithm for kernelizing t-Hitting Set. 

Theorem 3. For fixed r >2, if there is an algorithm running in time 0{kn+k'') 
that finds an 0{k''~^)-size kernel for “yes” instances of {r — 1)-Hitting Set, 
then there is an algorithm running in time 0{kn + that finds an 0{k”~^)- 

size kernel for “yes” instances o/r-HiTTiNG Set. 

Proof. The proof of this theorem is a generalization of the proof of Theorem 2, 
which we can describe more succinctly now. Let X = {S,C, k) be an instance of 
r-HiTTiNG Set. As before, we form a set F of elements that must be in any 
solution to X, and a set M of elements that might be, with F U M being our 
kernel of the desired size. Let Q he a maximal pairwise disjoint collection of sets 
from C chosen using a greedy algorithm running in time 0(n). If |^| > k, there 
is no solution to X. If |^| < k, we let E = 1Jcg6 ^ ^ must hit every set in C, 
and \E\ < rk. 

For each e G E, we define Cg = {T \ e G T,T G C}, and C' = {T \ {e} | T G 
Cej. Then Jg = (S \ {e},C',fc) is an instance of (r — 1 )-Hitting Set. By the 
statement of the theorem, either Xg is a “no” instance, or we can find a kernel 
Kg of size 0{k”~^) for Xg, where rig is the size of Xg. In the former case, e must 
be in any solution FI for X, because if it is not, FI would have to be a solution to 
Xg in order for it to hit all the sets in Cg; thus we can add e to F’. In the latter 
case, we add Kg to M. Since we are adding at most rk sets of size 0{k'~~"^) to 
either F or M, \F U M\ = 0{k”~^). 

Then K = FU M is a kernel for X. To see this, we define X' = {K,C\k, k). If 
X' is a “no” instance, so is X. If X' is a “yes” instance, let FI' be a solution for it; 
F C FI' . If T is any set in C, we must show that FI' hits T. Either F hits T, or. 
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since E hits T, there exists e G E OT; if e G H', then T is hit by EE . If e ^ H', 
Xe is a “yes” instance. Since Ke is a kernel for Xf., it hits T \ {e}, so T n Ke is a 
set in C I /c- Since H' is a hitting set iov C\k, it must hit T n KTe and thus T. 

The running time of the procedure is 0{n) + '^f^O{kne + fc”); since every 
set contains at most r elements, rze < rn, and the total running time is 
0(fcn + □ 

The method of Niedermeier and Rossmanith also generalizes easily to provide 
a kernel of size 0(fc”) for t-Hitting Set (though this is not explicitly mentioned 
in their paper). Unfortunately, the constants hidden in the O-notation increase 
with r in our case, but not for the Niedermeier-Rossmanith kernel. Thus The- 
orem 3 is not of much practical interest for larger values of r; even in the case 
r = 3, the Niedermeier-Rossmanith kernel is smaller for fc < 6. Niedermeier [8, 
p. 34] mentions that Nemhauser-Trotter kernelization seems to perform well 
in practice, and this suggests that our kernelization for 3-Hitting Set may 
also have practical merit. However, recent empirical work [1] describes instances 
where LP-based methods such as Nemhauser-Trotter do poorly. Clearly, much 
more investigation remains to be done. 

3 Conclusion 

Since Hitting Set is W[2]-complete when the size of sets in the collection is not 
bounded in size [5], it is unlikely that we will find a linear-sized kernel for this 
problem. However, the statement “r-HiTTiNG Set has a kernel of size /(r)A:” 
for some /(r) = is not inconsistent with what we believe about separations 
among parameterized complexity classes. It would be interesting to either prove 
this statement, or to demonstrate it false by proving lower bounds on the size of 
kernels for this and other FPT problems. 
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Abstract. The problem of packing k edge-disjoint triangles in a graph 
has been thoroughly studied both in the classical complexity and the 
approximation fields and it has a wide range of applications in many 
areas, especially computational biology [BP96]. In this paper we present 
an analysis of the problem from a parameterized complexity viewpoint. 
We describe a fixed-parameter tractable algorithm for the problem by 
means of kernelization and crown rule reductions, two of the newest 
techniques for fixed-parameter algorithm design. We achieve a kernel 
size bounded by 4fc, where k is the number of triangles in the packing. 



1 Introduction 

The problem of finding the maximum number of vertex or edge disjoint cycles in 
an undirected graph G has applications in many different fields, for instance in 
computational biology [BP96]. The problem is defined as follows: Let G = {V, E) 
be a simple graph. A triangle T in G is any induced subgraph of G having 
precisely 3 edges and 3 vertices. A graph G = {V, E) is said to have a k packing 
of triangles if there exist k disjoint copies Ti, ...,Tfe of T in the vertex set of G. 
The packing is called vertex-disjoint if Ti, ...,Tfe share no vertices and is called 
edge-disjoint if we allow Ti, ..., to have some vertices in common but no edges 
exist in Ti C^Tj when i ^ j .In this paper we look at the parameterized version of 
the edge-disjoint case. More formally, we will study in detail the parameterized 
complexity of the following problem: 

fc-EDGE Disjoint Triangle Packing (ETP) 

Instance: A graph G = {V,E), a positive integer k 

Parameter: k 

Question: Are there at least k edge disjoint instances of T in G? 

This problem is NP-hard for general graphs [HOL81] and has also been shown 
to be NP-hard for planar graphs, even if the maximum degree is 5 or more. 
Regarding approximability, ETP is known to be APX-hard [K94]. A general 
result of [HS89] leads to a polynomial time (3/2 -|- e) approximation algorithm 
for any e > 0 for this problem. If G is planar Baker [B94] gives a polynomial 
time approximation scheme for the vertex-disjoint case which can be extended to 
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solve ETP. Caprara and Rizzi give a polynomial time algorithm for ETP when 
the maximum degree of G is < 4 and prove that the problem remains APX-hard 
even for planar graphs with maximum degree 5 or more [CROl]. The similar 
problem of finding k vertex-disjoint triangles (NTP) has also been studied from 
a parameterized complexity point of view achieving a cubic kernel [FHRST04] . 

2 Parameterized Complexity and Kernelization: The 
Method of Coordinatized Kernels 

2.1 The Method 

Definition 1. A parameterized problem L is kernelizable if there is a parametric 
transformation of L to itself that satisfies: 

1. The running time of the transformation of (x,k) into {x',k'), where |a;| = 
n, is bounded by a polynomial q{n,k) (so that in fact this is a polynomial 
time transformation of L to itself considered classically, although with the 
additional structure of a parametric reduction), 

2. k! < k, and 

3. \x'\ < h{k), where h is an arbitrary function. 

We define the size of the kernel for L to be |^(fc)|- 

It is important to note the following result by [DFS99] : 

Lemma 1. A parameterized problem L is in FPT if and only if it is kernelizable. 

Proof of Lemma 1. Based on ideas from [CCDF97] □ 

From this definition the goal would obviously be to try to produce reduction 
rules to reduce the size of the instance. Then, simply trying an ‘intelligent’ brute 
force to the remaining instance would suffice to solve the problem. Obviously, 
the smaller the kernel is, the better algorithm we get. 

In the method of coordinatized kernels, the general aim is to prove a kernelization 
lemma. In the case of solving a graph maximization problem such as /c-Edge 
Disjoint Triangle Packing, this lemma has the following form: 

Lemma 2. (General Kernelization Lemma) If G = {V,E) is an irreducible in- 
stance of a problem II and G has more than f{k) vertices, then (G, k) is a 
Yes-instance for the problem. 

To prove such a lemma, we still have to set up a list of reduction rules R (leading 
to a specific notion of irreducibility) and a suitable bounding function /(fc). 
These reduction rules will reduce the size of the instance without destroying any 
optimal solution. The rules can always be performed in polynomial time. 

Our techniques for deriving a smaller kernel are based on: 

— the polynomial time reduction rules mentioned above, and 

— combinatorial extremal arguments. 
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The combinatorial extremal arguments help us obtain structural claims in the 
context of a so-called boundary lemma. This boundary lemma takes the following 
form: 

Lemma 3. (General Boundary Lemma) Let G be R-irreducible and let (G,k) 
be a Yes-instance and {G,k -1-1) be a Uo-instance for LI. Then the number of 
vertices of G is less than g{k) for some function g. 

Once the boundary lemma is proved, the kernelization lemma follows by simple 
contradiction. 

This method of coordinatized kernels has already been used to tackle many 
other graph maximization problems such as /c-Nonblocker [DFFPR03], k- 
Max Leaf [FMRSOO] and /c-Star Packing [FPS03] proving to be equally as 
successful as with fc-EoGE Disjoint Triangle Packing. But its usefulness has 
proven also important with minimization problems such as /c-Graph Modifica- 
tion Problem [FL03] and even with non-graph problems like Set Splitting 
[DFR03]. 

2.2 The Crown Rule 

Traditionally, most reduction rules used to reduce the size of the problem kernel 
were limited to reduction rules that were based on the structure of a fixed number 
of vertices. For example, given an instance (G, k) of the Vertex Cover problem 
which has a vertex v of degree 1 with N(v) = u, we can simplify the problem by 
reducing the instance from (G, k) to (G', k') where G' = G\ {u, u} and k' = k—1 
without losing any information on the solution set of the problem. 

However, other methods to simplify the problem exist that are not based on 
the structure of a fixed number of vertices. In [CFJ03] another reduction rule is 
introduced that uses a structural observation based on the following definition. 

Definition 2. A Grown Decomposition of a graph G = {V, E) is a tripartition of 
the vertices of the graph into three sets H, C and X with the following properties: 

1. C is an independent set, 

2. H is a cutset in V such that there are no edges between C and X. 

3. H is matched into C . 

The Grown Decomposition is applied to both SAViNG-fc-COLORS and Vertex 
Cover. An instance of the Vertex Cover problem (G,k) that contains a 
crown decomposition, can be reduced to (G', k') where G' = G \ H \ C and 
k' = k — \H\ by removing the two sets H (the Head) and G (the Crown) of 
vertices, as there is an optimal cover for (G, k) which contains all the vertices in 
H and none of the vertices in C [CFJ03]. 

Here the Crown Decomposition is used to generalize the simple reduction rule 
for a degree 1 vertex (which is the same for both problems) to one that admits 
a global structural generalization [F03], as each edge in the matching must be 
covered by the vertex that is connected to the rest of the graph. Further, it 
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can be shown that by repeatedly applying the crown reduction, a linear kernel 
can be obtained in polynomial time. For both problems, Vertex Cover and 
Saving-/c-Colors, a 3k kernel can be achieved [CFJ03]. 

It has been shown that the structural nature of crown decomposition based rules 
allows them to be applied faster than other computationally expensive techniques 
such as Linear Programming or Network Flow algorithm [ALSS03] . In addition, 
the simple structure of the crown decomposition implies that it can be adapted 
to other problems. This has already become a useful tool to obtain significantly 
smaller (often linear) sized problem kernels. 

The following theorem given in [CFJ03] can be used to bound the size of the 
graph based on the size of the independent set for which we can find a Crown. 
This is expressed here in a slightly more general form. 

Lemma 4. If a graph G = {V, E) has an independent set I C V (G) such that 
|-^(.^)| < |.^| then G has a crown. 

In this paper we describe how a slightly modified version of the crown decom- 
position can be used to obtain a linear kernel for the FTP problem using an 
auxiliary graph module. Other modifications of the crown rule can be found in 
[FPS03] [FHRST04] [PS03]. 

3 The Method on fc-EDGE Disjoint Triangle Pagking 

As mentioned in section 2.1, a list of reduction rules for the problem must be 
produced. These reduction rules are all computable in polynomial time on the 
size of the instance and transform G into another graph G' with the property 
that G has a solution of size k if and only if G' has a solution of size k' where 
k' <k and in general |G'| < |G|. 

The kernelization and boundary lemmas will be proved in section 3.2. 

3.1 Reduction Rules 

For each Reduction Rule described below we assume that none of the previous 
Reduction Rules can be applied to (G, k) anymore. 

Reduction Rule 1 If there exist u,v,w G V such that u and v have degree 
2, and (u,v), (u,w), (v,w) € E, then G is a Yes-instance o/ fc-EoGE Disjoint 
Triangle Packing if and only if (G' = G \ {m, f }, k — 1) is a Yes-instance. 

Proof of Reduction Rule 1. The vertices and w form a dangling 
triangle which will always be part of any optimal solution of fc-EDGE 
Disjoint Triangle Packing. □ 

Reduction Rule 2 If a vertex v is not part of any triangle then (G, k) is a 
Yes-instance of fc-EoGE Disjoint Triangle Packing if and only if (G' = 
G\ {v},k) is a Yes-instance o/fc-EoGE Disjoint Triangle Packing. 
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Proof of Reduction Rule 2. Trivial, as v will never participate in any 
triangle packing. □ 

Reduction Rule 3 If an edge e is not part of any triangle then (G, k) is a 
Yes-instanee of fc-EoGE Disjoint Triangle Packing if and only if (G' = 
G\ {e},/c) is a Yes-instance o/ fc-EoGE Disjoint Triangle Packing. 

Proof of Reduction Rule 3. Trivial, as e will never participate in any 
triangle packing. □ 

We say that an edge (u, v) is spanned by a vertex w if (u, w) and {v, w) are in 
E. A set V of vertices spans a set E' of edges if for every e G E' there exists a 
vertex in V' spanning e. We will denote E' as 5'(y'). 

Reduction Rule 4 (Fat-Head Crown Rule) Assume the following structure: 

1. C, an independent set of vertices. 

2. H, a set of edges spanned by C, i.e. S{C). 

3. X, the rest. 

4. The endpoints of the edges in H, V{H) form a cutset such that there are no 
edges from C to X \ V (H). 

5. f, an injective function mapping each edge (hi, ft- 2 ) G H to a unique pair of 
edges {h\,u), (ft- 2 , u) where u G C. 

Then (G, k) is a Yes-instance of fc-EDGE Disjoint Triangle Packings/ and 
only if{G\ {CUH),k— \H\) is a Yes-instance o//c-Edge Disjoint Triangle 
Packing. 

Proof of Reduction Rule 4. 

First note that V{H) forms a cutset with no edges from C to X \ V {H) 
as every edge in the graph must be part of a triangle and if there existed 
an edge from C to X -\V{H) that edge would not be part of a triangle 
and thus the graph would not be reduced under Reduction Rule 3. 

We will prove the two implications of the if and only if separately: 

=^: / assigns a triangle to each edge in H. This edge can only be used 
in a single triangle in the packing. Further, the maximum number of 
triangles we can possibly add to the packing with the elements of H 
is \H\. With our construction we achieve this goal using only elements 
in G. Note that the elements in G are independent, and therefore can 
only form triangles with the edges in H. It is clear that by choosing 
the triangles formed with G and H we can do no worse than choosing 
triangles formed with H and X, as using the vertices in G allows the 
possibility of extra triangles to be formed in X . 

<J=: Assume we have a set of |i? | non-edges^ between vertices in the graph. 

If we add an independent set G of vertices spanning these non-edges plus 
all the edges missing then we will get a new solution with precisely \H\ 
more triangles. □ 



^ A non-edge is a pair of vertices u,v in G such that (ti, v) ^ E 
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Lemma 5. Let I he an independent set of vertices in G = (V,E) such that 
1^1 > |5'(^)|- Then there exists a fat-head crown, i.e. a partition (C,H,X) of G 
and a function f constituting a fat-head crown as described in Reduction Rule 4- 

Proof of Lemma 5. Let G be a graph with an independent set / such that 
1^1 > |5'(^)|- From G = {V, E) we construct a graph model G' = {V',E') by 
altering G as follows: 

— For each edge e G S{I) add a vertex v to V . 

— For every u in / spanning e, add {u,v) to E' . 

— Add an edge from v to the endpoints of e. 

— Remove the edges from the endpoints of e to u from E' . 

By our construction it is clear that if |/| > |S'(/)| in G then \I'\ > |fV(/')| in 
G' and by Lemma 4 there exists a crown in G', where I' is the vertices of G' 
corresponding to / in G. 

If G' has a crown decomposition {G' , H' , X'), then G has a fat-head crown 
{C,E[,X), where: 

— G = G' is an independent set by definition. 

— H is the set of edges represented by the vertices in H' excluding its endpoints. 

— / corresponds to the matching between G' and H' (by the nature of the 
construction of G'). 

— A is the rest of the graph. 

Thus, if |/| > \S{I)\ there exists a fat-head crown decomposition. □ 

3.2 Kernelization and Boundary Lemmas 

Lemma 6. (Kernelization Lemma) Lf G = (V, E) is reduced under reduction 
rules 1-4, and \V\ > 4fc, then G is a Yes-instance for /c-Edge Disjoint Tri- 
angle Pagking. 

Before we prove this lemma we need the following result. 

Lemma 7 . (Boundary Lemma) Lf G = (V, E) is a Yes-instance of the /c-Edge 
Disjoint Triangle PAGKiNGproWem for parameter k, a No-instance for pa- 
rameter A: -I- 1, and G is reduced under reduction rules 1-4, then \V\ < 4/c. 

Proof of Lemma 7 . (By minimum counterexample). Assume for contradiction 
that there exists a counterexample G', such that G' is reduced and contains a 
/c-Edge Disjoint Triangle Pagking, but there is no packing of the edges of 
G allowing A: -I- 1 triangles. Assume further that |E(G)| > 4A:. 

Let us consider the following witness structure as a partition of V : 

— P, an edge-disjoint triangle packing of size k plus all edges between triangles 
in the packing {ghost edges). Note that P contains less than 3k vertices. 

— O, the rest {outsiders). 
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Structural Claims: 

Claim 1 There are no triangles in O. 

Proof of Claim 1. If there existed a triangle in O, then we could increase 
the size of the packing by 1, contradicting our assumption that G is a 
No-instance for {k + 1)-Edge Disjoint Triangle Packing. □ 

Claim 2 < O >\edges= 0 (he. O is an independent set). 

Proof of Claim 2. Assume there exists an edge (ui, U 2 ) for some U\,U 2 G 
O. Then by reduction rule 3, it must be part of some triangle, and by 
claim 1, the triangle is not in O. So there exists a vertex v G P that 
forms this triangle. Then we have a triangle that is not in the packing, 
so we could increase out packing by 1, contradicting our assumptions. □ 



Claim 3 For all u G O, u does not span any ghost edges in P. 

Proof of Claim 3. Assume there was such a vertex. Then we would have 
a triangle that is not included in the packing, and we could increase the 
packing by 1, contradicting our assumptions. □ 



Claim 4 For every triangle {u, v, w) G P, if one edge is spanned by x G O, then 
there is no y G 0,y ^ x such that y spans another edge in (u,v,w). 

Proof of Claim ). Assume that we had such an arrangement. Then, 
without loss of generality, assume x spans the edge (a, v) and y spans 
the edge (v,w). We can replace (u,v,w) G P with two triangles (u,v,x) 
and (v, w, y), and thus increase the size of the packing by 1, contradicting 
our assumptions. □ 



Claim 5 If a vertex x G O spans more than one edge in a triangle^ {u,v,w) G P, 
then it is the only vertex that spans any edge in (u,v,w). 

Proof of Claim 5. This is simply a special case of claim 4. □ 

From now on we consider a partition of P into three subsets Pq, P\ and Pa 
defined as follows: 

— Pq\ triangles in P with no vertices in O spanning any of their edges. 

— Pi ', triangles in P which have exactly one edge spanned by vertices in O. 

— Pa', triangles in P which have all edges spanned by some vertex in O. 

Note that if a vertex spans more than one edge of a triangle, it must span all three 



2 
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We define the number of triangles in Pq to be s , the number of triangles in P\ 
to be I and the number of triangles in Pa to be k — I — s. 

By this definition we can claim: 

Claim 6 Let Oq = A^(Po)|o> l*^o| = 0- 

Claim 7 Let Oa = N{Pa)\o, then \Oa\ < k — I — s(< k — s'). 

Proof of Claim 7. By claim 5, each triangle in Pa must have precisely 
one neighbor in O, also note that a vertex in O may completely span two 
triangles Pa- Then as \Pa\ = k — l — s (by definition), \Oa\ < k — l — s.D 

Claim 8 Let Oi = N{Pi)\^_^^, then |Oi| < 1. 

Proof of Claim 8. By definition of Pi , only one edge in each triangle in 
Pi is spanned by vertices in O. There are precisely I triangles in Pi, and 
by lemma 5, if |Oi| > I, we would have reduced by reduction rule 2.2 
(the crown rule). □ 



Claim 9 |0| = \Oa U Oi| = \Oa \ + |Oi| — \Oa nOi| < l + k—l — s = k — s < k. 

Proof of Claim 9. As the number of vertices in Oq is 0 (claim 6), \0\ = 

\Oa U Oi I . The rest follows trivially from the definitions of set operations 
and claims 7 and 8. □ 



Claim 10 \V\ < 3k + k = Ak. 

Proof of Claim 10. By definition |y| = |P| + |0|. □ 

Thus from claim 10, we have that the total size of |y| is at most 4fc contradict- 
ing our initial assumptions and thus we have the proof of the boundary lemma. □ 

Proof of Lemma 6. Assume in contradiction to the stated theorem that there 
exists a reduced graph G of size |1^(G)| > Ak but has no /c-Edge Disjoint 
Triangle Packing. 

Let k' < k he the largest k' for which G is a Yes-instance. By the Boundary 
Lemma 7 we know that |E(G)| < Ak' < Ak. This contradicts the assumption. □ 

4 Description of the Algorithm and Running Time 
Analysis 

In the previous section we obtained a kernel for the problem of size Ak on the 
number of vertices. Now, we are in a position to apply the following algorithm 
to this reduced instance to achieve the claimed running time. The algorithm 
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is nothing but a brute-force approach on the 4fc kernel: it first tries all possible 
collections of vertices that could produce a /c-Edge Disjoint Triangle Pagk- 
ING and then, for each one of these collections, tries all subsets of 3fc edges. The 
number of vertices which could possibly give a /c-Edge Disjoint Triangle 
Packing less than 3k as shown in section 3.2. The number of possible combi- 
nations of 3k edges, considering that we have at most 3k vertices in a collection, 

will be always less than ) . 



Step 1. Generate all subsets of vertices that are 

candidates to form an edge-disjoint triangle 
packing. 

Step 2. For each of these subsets find all possible 
fc-triangles by testing every subset of edges 
of size 3k. 

Step 3. If the answer is Yes for one of the subsets of 
edges in step 2, answer Yes and halt. 

Step 4. Else answer No and halt. 



The running time of this algorithm is the following: 



3k 

E 

z=0 



< 



{3ky 

3k 



3k 



2=0 



4k 






i=0 



= (3fc)^ • 2^^ = 

It is to be noted that we could achieve single exponential running time for k- 
Edge Disjoint Triangle Packing. The algorithm to obtain this running 
time uses a color coding technique, also known as hashing [AYZ95]. The idea 
is to use dynamic programming on a 3k coloring of the edges of the graph. For 
each of the colorings we generate a table which has an entry for every set S of 
colors whose size is a multiple of 3. The entry for S, 

f 1 if 3 S" C S', such that \S'\ = |S| - 3, S' = S \ {u, v, w} 
answer{S) = < and answer(S') = 1 
[ 0 otherwise 

The running time of this algorithm is ■ #7f), where is the number of 

Hash Functions, which is of the order of c^. Unfortunately c is a huge constant, 
making this algorithm unfeasible in practical terms, for more details on this type 
of technique for packing problems we refer the reader to [FK-l-04] . 



5 Conclusions and Further Research 

To pack k edge-disjoint triangles, we utilize two of the newest techniques in 
parameterized complexity, namely the method of coordinatized kernels and crown 
type reduction rules. 
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In section 3.1 the use of polynomial time reduction rules, especially the crown 
rule, prunes the instance to the manageable size of 4/c vertices. Here k is the 
number of triangles we are trying to pack, which is smaller than \E\. The method 
provides us with a systematic tool to analyze the kernel size obtained by these 
reduction rules. 

We achieve a running time for the triangle packing case that could be extended 
to cycles of length greater than three using the same technique. Probably a 
modification of the crown reduction rule we use in this paper would be needed 
in the more general case of /c-Edge Disjoint Cycle Packing: 

fc-EoGE Disjoint Cygle Pagking 
Instance: A graph G = (V,E), a positive integer k 
Question: Are there at least k edge disjoint instances of Cr in G? 

The choice of parameter in this case is not as clear. We could either choose only 
k as in the case of fc-EoGE Disjoint Triangle Pagking or both k and r, 
the length of the cycle. The problem is still NP-complete [HS89] and APX-hard 
[K94] and has a great number of applications in computational biology [BP96]. 
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Abstract. The problem of packing k vertex-disjoint copies of a graph H into an- 
other graph G is NP-complete if H has more than two vertices in some connected 
component. In the framework of parameterized complexity we analyze a particu- 
lar family of instances of this problem, namely the packing of stars. We prove that 
packing k copies of H = TTi.s is fixed-parameter tractable and give a quadratic 
kernel for the general case. When we consider the special case of s = 2, i.e. H 
being a star with two leaves, we give a linear kernel and an algorithm running in 
timeC)*(2®“). 



1 Introduction 

The problem of Maximum iJ-MATCHiNG, also called Maximum iT-PACKiNG, is of 
practical interest in the areas of scheduling [BM02], wireless sensor tracking [BKOl], 
wiring-board design and code optimization [HK78] and many others. 

The problem is defined as follows: Let G = (V, E) be a graph and H = {Vh, Eh) 
be a fixed graph with at least three vertices in some connected component. An H- 
packing for G is a collection of disjoint subgraphs of G, each isomorphic to E[. In an 
optimization sense, the problem that we want to solve would be to find the maximum 
number of vertex disjoint copies of El in G. The problem is NP-complete [HK78] when 
the graph H has at least three vertices in some connected component. Note that in 
the case where El is the complete graph on two nodes iT-packing is the very well 
studied (and polynomial time solvable) problem Maximum Matching. Maximum 
TT-PACKING has been thoroughly studied in terms of approximation. The problem has 
been proved to be MAX-SNP-complete [K94] and approximable within |Vff |/2 + £ for 
any £ > 0 [HS89]. Several restrictions have also been considered (planar graphs, unit 
disk graphs etc.) in terms of the complexity of their approximation algorithms. For a 
review of these we refer the reader to [AC99]. In parameterized complexity a recent 
result by [FHRST04] gives a general algorithm for packing an arbitrary graph H into 
G. Their result gives a log |h|) algorithm for the general case. 

We discuss the parameterized complexity of the Maximum TT-packing problem 
for the case when H belongs to the restricted family of graphs T = ATi,s, a star with s 
leaves. More formally: 



R. Downey, M. Fellows, and F. Dehne (Eds.): IWPEC 2004, LNCS 3162, pp. 138-148, 2004. 
© Springer- Verlag Berlin Heidelberg 2004 
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iLi,s-PACKING 

Instance: Graph G = (V, E), a positive integer k 

Question: Are there at least k vertex disjoint instances of ATi ^ in G? 

This problem has already been studied within the framework of classical complexity 
theory [HK86]. In their paper, Hell and Kirkpatrick studied the complexity of packing 
complete bipartite graphs into general graphs. We include a brief introduction to this 
topic in Section 2. In Section 3 we show that the general problem is tractable if pa- 
rameterized, and that we can obtain a quadratic kernel. In Section 4 we show that the 
special case of packing iLi, 2 ’s has a linear kernel, and in Section 5 we give a quick 
algorithm for both the general and special case. In contrast [FHRST04] obtains only a 
0{k^) for packing a graph with three vertices, namely K^. Finally we conclude with 
some directions for further research. 



2 A Brief Introduction to Parameterized Complexity 

In classical complexity theory, a decision problem is specihed by two elements: the 
input to the problem and the question to be answered. In parameterized complexity a 
third element of information is added, the parameter. Also, in classical complexity the- 
ory, the notion of ’good’ behavior is related to polynomial time solvability, i.e. if there 
is an algorithm solving the problem in polynomial time on the size of the instance. In 
parameterized complexity a problem is said to have a “good” behavior if the combina- 
torial explosion can be limited to the parameter, i.e. it is hxed-parameter tractable. 



2.1 The Two Races 

Currently, in Fpt algorithm design there are two races [F03]. Firstly, to obtain the best 
running time, is the obvious race as it derives straight from the definition of fixed- 
parameter tractability. Here the emphasis is put in the function /. We will adopt the 
G*(/(fc))-notation for running time, where the polynomial part of the function is ig- 
nored. Generally the polynomials will be negligible compared to f{k). 

Improvements in the algorithms in this race may arise from better preprocessing 
rules, better branching strategies or more sophisticated methods of analysis. 

Another race is the kernelization race, which is the one we contribute to (mainly) in 
this paper. The kernel of a problem is dehned as follows: 

Definition 1. A parameterized problem L is kernelizable if there is a parametric trans- 
formation of L to itself that satisfies: 

1. The running time of the transformation of {x,k) into {x' ,k'), where |x| = n, is 
bounded by a polynomial q{n, k) (so that in fact this is a polynomial time transfor- 
mation ofL to itself, considered classically, although with the additional structure 
of a parametric reduction), 

2. k' < k, and 

3. \x'\ < h{k'), where h is an arbitrary function. 
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From this definition the goal would obviously be to try to produce better reduction 
rules to reduce the size of the instance as much as possible. It is important to note the 
following result by [DFSOO], which gives us the link between the two race: 

Lemma 1. A parameterized problem L is in Fpt if and only if it is kernelizable. 

The two races are worth playing as they may lead to substantial improvements on 
the quality of the algorithms we design and also to new strategies for practical imple- 
mentations of these algorithms. 

2.2 Preliminaries 

We assume simple, undirected, connected graphs G = (V,E) where \V\ = n. The 
neighbors of a vertex v are denoted as the set N (u), and the neighbors of a set S' C V, 

mS) = U.es^(^)\S- 

The induced subgraph of S C y is denoted G[S]. 

We use the simpler G \ v to denote G = (y \ {u}, E) for v a vertex and G \ e to 
denote G = (V, E \ {e}) for e an edge. Likewise G\V' denotes G = (V \ V' , E) and 
G\ E' denotes G = {V,E\ E') where V' is a set of vertices and E' is a set of edges. 

We say that Ffi ^ is a s-star or a star of size s. Pi denotes a path of i vertices and 
(i — 1) edges. 



3 Parameterized Complexity of Star Packing 

In this section we are going to prove a series of polynomial time preprocessing rules 
(reduction rules) and eventually show that we can obtain a kernel of quadratic size on 
the parameter k for the parameterized version of ATi ^-packing. 

We use the following natural parametrization of ATi g- PACKING: 

fc-Ffi,s-PACKING 

Instance: Graph G = (y, E) 

Parameter: k 

Question: Are there k vertex disjoint instances of ATi ^ in G? 

To remove vertices of high degree and remove useless edges between vertices of 
low degree we introduce the following reduction rules. 

Lemma 2. Let G be a graph such that G y, deg{v) > k{s -I- 1) — 1. G has a 
k-Kis-packing if and only ifG' = G\v has a {k — \)-Kig-packing. 

Proof If G has a fc-Ffi ^-packing then it is obvious that G' has a (A: — l)-ATi_s as v 
cannot participate in two different stars. 

If G' has a (A: — Ij-iFi^s-packing we can create a A:-A'i ^-packing by adding v. The 
A: — 1 stars already packed cannot use more than (s-l- 1)(A: — 1) of u’s neighbors, leaving 
s for V. 
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Lemma 3. Let G be a graph where 3u, v G V (G), uv G E{G) and deg{u) < deg{v) < 
s. G has a k-packing if and only ifG' = G\ uv contains a k-packing. 

Proof. If G has a fc-iLi ^-packing then it is obvious that G' has a /c-iLi g-packing as uv 
can never participate in a iLi g. 

If G' has a fc-iLi_s-packing it is obvious that G has a /c-iLi g-packing as well. 

To prove that the fixed parameter version of A: -Star PACKING is tractable we will 
use a new technique first seen in [FMh- 00]. This technique borrows ideas from extremal 
graph theory. In essence we look for a minimum counterexample represented by a wit- 
ness structure and with the aid of reduction rules produce a series of structural claims 
that help us find a bound on the size of the instance produced by those reduction rules. 
This technique has proven to be a very potent tool in Fpt -algorithm design, both be- 
cause of its strength and also because of its ease of use. 

We try to establish a bound where it is guaranteed that any graph larger than this 
bound has a fc-iLi ^-PACKING, thus implying a kernel for the problem. We will prove 
this fact, labelling it kernelization lemma, at the end of this section. 

Let a graph be reduced when lemmas 2 and 3 are no longer applicable. In this sense 
both these lemmas will be commonly referred to as reduction rules. 

Lemma 4. ( Boundary Lemma ) 

If a graph G is reduced and has a k-Ki g-packing, but no {k + l)-Ki s-packing 
then \V (G)| < fc(s^ -I- ks'^ -I- fcs -I- 1). 

Proof. Assume there exists a counterexample G, such that G is reduced and contains a 
/c-ATi^s-packingVF, butno(fc-|-l)-iLi_s-packingandsize |F(G)| > k{s^+ks'^+ks+l). 

Let Q be y \ pp. Let Qi be the vertices in Q that have degree i in the subgraph 
induced by Q. We will now prove a series of claims that bound the number of vertices 
in Q. 

Claim 1. Wi > s,Qi = ib 

Proof of Claim 1. This is clear as otherwise W could not be maximal. 

□ 

Claim 2. A Ki^g-star S GW has at most + k{s -I- 1) — 1 neighbors in Q. 

Proof of Claim 2. Assume in contradiction that there is a reduced graph 
with a star S that has more than -I- k{s -I- 1) — 1 neighbors in Q. Let 
u C S' be the vertex in S that has the highest number of neighbors in Q. 

From Lemma 2 we have that v has at most /c(s -I- 1) — 1 neighbors. Let 
u be the vertex in S \ u that has the highest number x of vertices in Q \ 

N (v), it is clear that x < s otherwise u and v can form two new stars, 
contradicting the assumption that G has no {k + l)-packing. From this 
we conclude that at most one vertex in S has s or more neighbors in 
Q. This gives us that each star in W has at most -I- k{s -I- 1) — 1 
neighbors in Q. □ 



Claim 3. W has at most k ■ (s^ -I- k{s -I- 1) — 1) neighbors in Q. 
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This follows from Claim 2. 

Let R = V \ {W U N{W)) i.e. the set of vertices of Q which do not have 
neighbors in W. 

Claim 4. R is an independent set in G. 

Proof of Claim 4. Assume in contradiction that there exists two ver- 
tices u,v G R that have an edge uv G E between them. By Claim 1 
we know that both u and v have degree less than s, but then by Lemma 
3 we know that uv has been reduced, contradicting the assumption. □ 

Claim 4 ensures us that all vertices in R have an edge to one or more vertex in Q. By 
Claim 1 we know that each of the vertices in Q \ i? have at most s — 1 such neighbors 
and thus by Claim 3 we know that the total size of R is at most (s — 1) • |Q \ .Kl- 
in total, G has size |C(G)| = \W\ + |Q| < k{s + 1) + s • /c • (s^ + k{s + 1) — 
1) = k{s^ + ks^ + /cs + 1) contradicting the assumption that the graph had more than 
k{s^ + ks^ + fcs + 1) vertices. This concludes the proof of the boundary lemma. □ 

Having proved the boundary lemma we can now state that any reduced instance that 
is still ’big’ has a fc-ATi ^-packing. Since the boundary given by the Lemma 4 does not 
depend on the main input, but only on the parameter and the problem in question. We 
can say that the reduced instance is a ’problem-kernel’ and that the problem is in Fpt . 

Lemma 5. (Kernelization Lemma) If a graph G is reduced and has \V (G)| > fc(s^ + 
ks^ -f /cs -f 1), then it contains a k-Ki^g-pocking. 

Proof. Assume in contradiction to the stated theorem that there exists a graph G of size 
\V (G)| > k{s^ + ks^ -f fcs -f 1), but where G has no /c-ATi^s-packing. 

Let A:' < fc be the largest k' for which G is a YES-instance. By the Boundary 
Lemma 4 we know that |C(G)| < fc'(s^ + k' + k's -f 1) < k{s^ + ks^ -f /cs -f 1). 
This contradicts the assumption. □ 

Thus for any A: -ATi s -packing we can prove a quadratic kernel. However, for the 
special case s = 2, we can improve on this. This is the topic of the next section. 

4 The Special Case of P 3 : A Linear Kernel 

A 2-star can also be seen as a path with three vertices, denoted P3. For this special case 
we can employ a different set of reduction rules to obtain a linear kernel for packing 
Pa’s into a graph. 

A;-P3-PACKING 

Instance: Graph G = (H, E) 

Parameter: k 

Question: Are there k vertex disjoint instances of P3 in G? 

To improve on the quadratic kernel obtained in the previous section, we will make 
use of a series of reduction rules based on the ideas of crown decompositions [CFJ03]. 
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Definition 2. A crown decomposition {H, C, R) in a graph G = {V, E) is a partition- 
ing of the vertices of the graph into three sets H, C, and R that have the following 
properties: 

1. H (the head) is a separator in G such that there are no edges in G between vertices 
belonging to G and vertices belonging to R. 

2. G = Gu U Cm (the crown) is an independent set in G. 

3. I Cm I = \H\, and there is a perfect matching between Cm ond H. 

There are several recent papers that use crown decompositions of graphs to obtain 
good results in parameterized complexity [CFJ03, FHRST04, F03, ACFL04, PS04]. 
[CFJ03, ACFL04] solving VERTEX COVER can use the crown structure directly. The 
others have to either modify the input graph or create an auxiliary graph where a crown 
decomposition will lead to a reduction in the size of the input graph. 

In this paper we instead modify the crown decomposition to fit our particular prob- 
lem. The first variation is ‘double crown ’-decomposition^ where each vertex in H has 
two vertices from C matched to it (as opposed to only one). 

Definition 3. A double crown decomposition (iF, C, R) in a graph G = (V, E) is a 
partitioning of the vertices of the graph into three sets El, C, and R that have the 
following properties: 

1. E[ (the head) is a separator in G such that there are no edges in G between vertices 
belonging to C and vertices belonging to R. 

2. C = Cu U Cm U Cm 2 (the crown) is an independent set in G. 

3. |Cm| = \H\, |Cm 2 | = \H\ and there is a perfect matching between Cm ond El, 
and a perfect matching between Cm 2 ond El. 




Fig. 1. Example of ‘double crown’ 



Another variation of the crown is the ‘fat crown ’-decomposition‘s where instead of 
independent vertices in C we have AT 2 ’s as shown in hgure 2. 

^ The dashed lines in the figure indicate how each vertex in H is matched to two vertices in C. 

‘S As in the case of the ‘double crown’, the dashed lines indicate the matching between H and 
Cm and the dashed ellipses show which K2 the vertex in H is matched to. 




144 



Elena Prieto and Christian Sloper 



Definition 4. A fat crown decomposition {H, C, R) in a graph G = (V, E) is a parti- 
tioning of the vertices of the graph into three sets H, C and R that have the following 
properties: 

1. H (the head) is a separator in G such that there are no edges in G between vertices 
belonging to G and vertices belonging to R. 

2. G[G] is a forest where each component is isomorphic to K 2 

3. \C\ > \H\, and if we contract the edges in each K 2 there is a perfect matching 
between C and H. 




Fig. 2. Example of ‘fat crown’ 



Using the ‘crown’, ‘double crown’ and ‘fat crown’ we can create powerful reduction 
rules. 

Lemma 6. A graph G = (V, E) that admits a ‘double crown ’ -decomposition (H, C, R) 
has a k-P^-packing if and only ifG \ {El U G) has a {k — \H\)-P 3 -packing. 

Proof. (4=:) If G \ {H U G) has a (/c — |Ef |)-P 3 -packing then it is obvious that G has 
a fc-Ps-packing as P U G has a |P|-P 3 -packing (v G H and v’s matched vertices from 
Gm and G „2 form a P 3 ). 

(=!>:) We want to prove that if G has a k-Ps-packing then G \ {H U G) has a 
{k — |P|)-P 3 -packing. Assume in contradiction that there exists a graph G' that has a 
crown-decomposition {H' , G', R') that contradicts the lemma. This implies that H'UG' 
participates in a; > \H'\ Pfs. Since H' is a cutset, and G is an independent set in the 
graph, every P 3 in G that has vertices in H' U G' must have at least one vertex from 
H' . Thus we can have at most \H'\ Pfs which is a contradiction. □ 

Lemma 7. A graph G = {V, E) that admits a ‘fat crown’ -decomposition (P, C, R) 
has a k-P^-packing if and only ifG \ (P U G) has a {k — \H\)-P 3 -packing. 

The proof of Lemma 7 is analogue to Lemma 6 , thus omitted. 

To apply crown-decompositions we need to know when we can expect to hnd one. 
A very useful result in this regard can be deducted from [CFJ03, page 7], and [F03, 
page 8 ]. Fortunately, the results also apply to the variations of crown decomposition 
described here. 
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Lemma 8. Any graph G with an independent set I, where |/| > |A'^(/)|, has a crown 
decomposition (iL, C, R), where H C N(I), that can be found in linear time, given I. 



Corollary 1. Any graph G with a collection J of independent K2S such that |-/V(<7) | < 

I J|, has a fat crown decomposition {H, C, R), where H C N{I), that can be found in 
linear time, given J. 

Proof. This follows from the previous Lemma. If we replace each K2 with a single 
vertex, then hy Lemma 8 this graph admits a crown-decomposition. We can reintroduce 
the K2S to obtain a ‘fat-crown’. □ 

Lemma 9. Any graph G with an independent set I, where |/| > 2 • |fV(/)|, has a 
double crown decomposition {H, G, R), where H C N (/), that can be found in linear 
time, given I. 

Proof. Let G be a graph with an independent set / C V{G) such that |fV(/)| < |/|. 
Create a graph with G' = G, but for every vertex v G N{I) add a copy v', such 
that N{v) = N{v'). By Lemma 8 G' has a crown-decomposition {H, G, R) such that 
H C Nc'{I). We now claim that we can use this crown to construct a ‘double crown’ 
{H',G',R') in G. 

First observe that v G H if and only if v' G H. Assume in contradiction that v' G H 
but v' H. V must be matched to some vertex u in G. Since N{v) = N{v') we have 
that v' cannot be in G as it would contradict that G is an independent set. Also v' is not 
in R as that would contradict that H is a cut-set. Thus v' must be in H, contradicting 
the assumption. 

With this observation the result follows easily as H consists of pairs of vertices, a 
vertex and its copy. Each pair v and v' in H is matched to two vertices ui and U2. In G, 
let V be in H' and let it be matched to both u\ and U2. Do this for every pair in H. It is 
easy to see that this forms a double crown in G. □ 

We will now describe a polynomial time preprocessing algorithm that reduces the 
graph to a kernel of size at most 15 fc. 



Step 1. Compute an arbitrary maximal Ps-packing 
W. Let Q = V\W. 

Step 2 . Let X be the components in G[Q] 

isomorphic to K2. If \X\ > |A^(AT)| in G 

then reduce by Lemma 7 . 

Step 3 . Let / be the isolated vertices I in G[Q] . 

If |/| > 2|A^(/)| in G then reduce by Lemma 

6 . 



Lemma 10. If\V{G)\ > Ibk then the preprocessing algorithm will either find a k-P^- 
packing or it will reduce G. 
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Proof. Assume in contradiction to the stated lemma that |F(G)| > 15fc, but that the 
algorithm produced neither a /c-Pa-packing nor a reduction of G. 

By the assumption the maximal packing W is of size \W\ < 3k. Let Q = V \ W. 
Let Qi be the vertices in Q that have degree i in the graph induced by Q. 

Claim 5. Vz > 2, Qi = 0 

Proof. This is clear as otherwise W could not be maximal. □ 



Claim 6. |Qi| < 6 /c 

Proof. Assume in contradiction that |Qi | > 6 fc. This implies that num- 
ber of K 2 ^ X in Q is greater than 3k, but then \X\ > |VL|. By Corol- 
lary 1 G has a ’fat crown’ and should have been reduced in step 2 of 
the algorithm, contradicting that no reduction took place. □ 



Claim 7. |Qo| < 6/c 

Proof. Assume in contradiction that |Qo| > 6 /c, but them |Qo| is more 
than 2\W\ and by Lemma 9 G has a ’double crown’ and by Lemma 6 
should have been reduced in step 3 of the algorithm, contradicting that 
no reduction took place. □ 



Thus the total size |C(G)| = \W\ + |Qo| + |Qi| + IQ2I + • ■ • < 3k + 6k + 6k + 0 = 
I5k. This contradict the assumption that \V (G) | > 15k. 



□ 



Corollary 2. Any instance (G, k) of P^-packing can be reduced to a problem kernel of 
size 0{k). 

Proof. This follows from the Lemma, as we can run the preprocessing algorithm until 
it fails to reduce G. By Lemma 10 the size is then at most 15fc. □ 

5 Running Time 

We will apply a straightforward brute-force algorithm on the kernels to find the opti- 
mal solution. In the case of P 3 -packing we will select the center- vertices of the P 3 S in a 
brute force manner. There are ways to do this. By Stirling’s formula this is approx- 
imately With fc-center vertices already selected the problem reduces to a bipartite 
problem where the question is if the left hand side each can have 2 neighbors assigned 
to it. This can easily be transformed to Maximum Bipartite Matching by making 
2 copies of each vertex on the left hand side. Maximum Bipartite Matching can 
be solved in time G(\/|C||P|) [HK73]. Since we now have 15fc + k vertices, and thus 
O(k^) edges. We can solve each of these in time 0{k^'^). Giving a total running time 
of 0{2^-^^k'^-^), or 0*{2^-^^) when written in O* notation. 

Applying the same technique for the s-stars we will achieve O* (2® asymp- 
totically worse due to the quadratic kernel. 
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6 Conclusions and Further Research 

Packing vertex-disjoint copies of a graph H into another graph G is NP-complete as 
long as H has more than two vertices [HK78]. We have analyzed within the frame- 
work of parameterized complexity a specific instance of this problem, the packing of 
vertex-disjoint stars with s leaves. We have proved that packing iLi 2 ® in a graph G, 
equivalently fc-Ps-PACKiNG has a linear kernel. 

Our algorithm for fc-Ps-Packing runs in time This running time arises 

from reducing the problem to a kernel of size 15k. We believe that this kernel can be fur- 
ther improved and thus the running time substantially decreased, however, it is already 
much better than k + k \ H \ log |tt|) tjje running time of the general algorithm in 

[FHRST04]. 

We have also proved that s-Star Packing (PTi g-Packing) is in general fixed-param- 
eter tractable with a quadratic kernel size. We proved a running time for this general 
case ofC)*(2‘^('=i°g'=)). 

An open problem here is whether it is possible to obtain a linear sized kernel for 
s-Star packing for any s > 3 such as the one obtained in the case of fc-Pg-PACKlNG. 

There are several related problems that could be considered on the light of the tech- 
niques used in Section 3. The most obvious one is the following: 

fc-iLi,s-PACKING 

Instance: Graph G = {V, E) 

Parameter: k 

Question: Are there k edge-disjoint instances of Ki s in G? 

This problem is fixed-parameter tractable when s = 2, 3 using Robertson and 
Seymour’s Graph Minor Theorem [RS99] since it can be easily proved that its No- 
instances are closed under the minor operations. The issue here is that this method is 
non-constructive and carries a fast growing function f{k). Possibly, applying similar 
arguments as those in Section 4 would lead to a much better running time. 

Acknowledgements. We would like to thank Mike Fellows for all the inspiring 
conversations leading to the completion of this paper. 
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Abstract. We consider an assembly line problem that occurs in varions 
kinds of production automation, for example, in the automated manu- 
facturing of PC boards®. The assembly line has to process a (potentially 
infinite) number of identical workpieces in a cyclic fashion. In contrast to 
common variants of assembly-line scheduling, the forward steps may be 
smaller than the distance of two stations. Therefore, each station may 
process parts of several workpieces at the same time, and parts of a 
workpiece may be processed by several stations at the same time. The 
throughput rate is determined by the number of (cyclic) forward steps, 
the offsets of the individual forward steps, and the distribution of jobs 
over the stationary stages between the forward steps. Even for a given 
number of forward steps and for given offsets of the forward steps, the 
optimal assignment of the jobs to the stationary stages is at least weakly 
A/’T’-hard. 

We will base our algorithmic considerations on some quite conservative 
assumptions, which are greatly fulfilled in various application scenarios, 
including the one in our application: the number of jobs may be huge, but 
the number of stations and the number of forward steps in an optimal 
solution are small, the granularity of forward steps is quite coarse, and 
the processing times of the individual items do not differ by several 
orders of magnitude from each other. We will present an algorithm that is 
polynomial and provably deviates from optimality to a negligible extent 
(under these assumptions). This result may be viewed as an application 
of fixed-parameter tractability to a variety of real-world settings. 
Keywords: assembly line balancing, printed circuit boards, fast compo- 
nent mounters, stepping problem, cycle time minimization, fixed-param- 
eter analysis 



1 Introduction 

Problem description A (potentially infinite) number of identical workpieces is to 
be processed by an assembly line. Several workpieces are simultaneously in the 

* This work has been partially supported by DEC grant MU 1482/2. 

® In cooperation with Philips/ Assembleon B.V., Eindhoven, the Netherlands. 
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Fig. 1. An assembly line with five stations. In this example, the (cyclic) stepping 
scheme consists of four stationary stages (the fifth stage is identical with the first 
stage). 



assembly line, and the offset of two workpieces is fixed. The stations (machines) 
of the assembly line must perform a set of specific jobs on each workpiece. Jobs 
have to be processed without preemption. Each job has a certain location on 
the workpiece. A job can only be performed if it is in the visibility region of 
the station on which it shall be executed. The visibility region of station i is an 
interval [Li . . . Ri] of the assembly line. See Figure 1. The process decomposes 
into stationary stages, in which the assembly line does not move forward. After 
a stationary stage is finished, the band (and thus all workpieces) are moved 
forward by some common offset, and the next stationary stage commences. 

In many application scenarios of flexible manufacturing the described model 
of an assembly line is the appropriate variant. For us, this problem arose as 
a subproblem in a cooperation with Philips/ Assembleon B.V., Eindhoven, the 
Netherlands. Here the workpieces are printed boards. Each station holds a robot 
arm, which places various components on the boards. Jobs correspond to mount- 
ing steps: picking a component from a feeder; moving from the pick-up position 
to the position of this mounting job; mounting the component; moving back to 
the feeder for the next mounting job. 

In the literature, typically, the following special case is considered: each work- 
piece moves from station to station, each station may access and process exactly 
one workpiece in a stationary stage (all its jobs on this workpiece in fact), and 
each workpiece can only be processed by one station at a time. 

Motivated by our concrete application, the scenario of this paper is a bit 
different: the stations are very close together, and forward steps may be relatively 
small. Therefore, one workpiece may be processed by several stations in the 
same stationary stage, and one station may process several workpieces in the 
same stationary stage. See Figure 1 again. All workpieces must be processed by 
an identical schedule. In particular, each workpiece must exactly take over the 
position of its immediate predecessor after a certain number s of forward steps. 

With respect to an initial start configuration, the movement of the assembly 
line is completely described by a stepping scheme, namely by the overall number 
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s of forward steps and the offsets of the individual forward steps. The stepping 
scheme determines the positions of the workpieces for each stationary stage. 
Due to the restricted visibility region of each station, a job can usually only be 
performed on a subset of the stationary stages. 

Now the optimization problem is this: we have to determine (i) a stepping 
scheme and (ii) for each job an assignment to a stationary stage where the 
location of this job on the workpiece is visible to the station of this job. We call 
the stepping scheme together with such an assignment a (cyclic) assembly-line 
schedule. The objective is to minimize the cycle time of the schedule, that is, 
the time for one cycle of the process. This means the sum of the process times of 
all s stationary stages plus the time for moving the assembly line between two 
stationary stages (and between the last and first stationary stage of a cycle). 

Previous work Scholl [Sch99] surveys general problems and approaches to the 
balancing and sequencing of assembly lines. Ayob et. al. [ACK02] compare dif- 
ferent models and assembly machine technologies for surface mount placement 
machines. A recent overview on printed circuit board assembly problems has 
been given by Crama et al. [CKS02]. 

Assigning jobs to stationary stages seems to resemble batching problems on m 
machines which appear in several versions but different from the one considered 
in this paper [Bru97]. The most important differences to our problem are the 
definitions of the objective function and of the completion times of jobs. In the 
literature, the completion time of a job equals the completion time of the whole 
corresponding batch. Moreover, the assignment to machines is usually not given. 

There is some engineering work on this kind of assembly- line problems 
However, we are not aware of any mathematical work which considers the prob- 
lem of our paper (or variants thereof), although the problem might be quite 
natural and definitely occurs in practice in the form presented here. 

Spirit of the paper: Our main research topic is the application of fixed-parameter 
tractability [DF99, AGNOl, Fel02] to real-world optimization problems from 
Operations Research and the investigation of concrete problems and techniques. 
In this paper, we present a result of this project. Roughly speaking, a problem is 
fixed-parameter tractable with respect to a given set of parameters, if there is an 
algorithm for this problem whose run time is polynomial in the input size in case 
the values of these parameters are bounded. The literature on fixed-parameter 
tractability is mainly concerned with the analysis of “classical” problems such as 
those in [GJ79] with respect to (more or less) natural parameters. Here, we apply 
fixed-parameter tractability to real-world problems in a systematic, conscious 
effort.® The challenge in applied work is different because we are not free in the 

^ Internal reports of manufacturers like Assembleon. 

® The underlying intuitive ideas have been significantly influencing applied work in 
virtually all application domains. Therefore, systematic approaches like ours might 
be valuable. However, we are not aware of any work of this kind, except that the 
research on “classical” problems includes some basic problems that are related to 
bioinformatics [BDF+95]. 
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choice of the parameters. In fact, the challenge is to find parameters that not 
only allow efficient algorithms but also turn out to have small values in typical 
instances from an application or from a variety of applications. 

Our technique is different from the common techniques like, for example, ker- 
nelizatiovP and search trees which are typically applied to derive fixed-parameter 
results. Here we will see that another approach is more promising for our opti- 
mization problem. 

Specific contribution of the paper: Empirically, processing times of individual 
mounting jobs differ by not too much. The maximum processing time of a single 
job is typically very small in comparison with the cycle time. 

In practice, the offsets of the forward moving steps cannot become arbitrar- 
ily small. More precisely, the size of an offset is bounded from below and is a 
multiple of some parameter A. For example, for printed circuit board machines 
as developed by Philips/ Assembleon, the precision is about A « .5mm. 

The parameters that turn out to have bounded values are these: (1) the 
number m of stations, (2) the total number s of forward steps in an optimal 
solution, (3) the ratio qi of the offset of two workpieces divided by the granu- 
larity of forward steps, (4) the ratio q 2 of the maximal divided by the minimal 
processing times of jobs, and (5) q^ := ■ m/2N, where N denotes the total 

number of jobs. The first three parameters will bound the run time, and the 
last two parameters will bound the error. More specifically, the run time will be 
0{{N -|-TO^s^) •minjlVs'^'^ , N^}), and the relative error will be bounded by 92 • 9s- 
Note that the relative error vanishes as the number N of jobs increases. As a by- 
product, it will turn out that our algorithm is at least 2-approximative without 
assumptions on the parameters <72 and (73, and that it can be transformed to a 
PTAS. For a given relative error £ > 0, the run time of the PTAS algorithm is 
0{{N + m^s^) ■ 

Discussion: According to the “spirits” paragraph above, these five parameters 
were not chosen arbitrarily, but in fact they were chosen because quite often in 
practice the values of these five parameters might be very small. For example, in 
the application that motivated our work, m is in the order of 10 . . . 20, s is in the 
order of 3 ... 5, <71 is in the order of dozens or hundreds, (72 is close to 1, and 53 
is orders of magnitude smaller than 1 because N is huge. Therefore, the relative 
error is negligible under the (quite realistic) assumptions that the processing 
times of the jobs do not differ by orders of magnitude from each other, and that 
the total number of N jobs is orders of magnitude larger than the number of 
forward steps. 

In practice, mm{N s'^^ , N^} might typically be assumed at the second term, 
N‘^ . However, the first term plays a role in theory: the fact that the run time is 
0{{N + m^s^) ■ A^s^i) means that the problem is fixed-parameter tractable in 
the sense of [DF99, AGNOl, Fel02] if a relative error of <72 • <73 is admitted. 



It can even be proved [DFS99] that every fixed-parameter result can be polynomially 
transformed into a kernelization result. 
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Remark: Additional precedence constraints among jobs are quite common in 
practice. Therefore, it is worth noting that our algorithm can cope with prece- 
dence constraints, and the asymptotic run time remains the same. 

Organization: In Section 2, we reduce our problem to a certain weakly NP-hard 
core problem which has to be applied several times. We prove error bounds on 
a relaxed version of this core problem and derive a PTAS. In the second part 
of this section, we develop an efficient “core algorithm” for the relaxed version. 
Section 3 is devoted to the correctness proof of this core algorithm, whereas 
Section 4 achieves bounds on the number of core problems to be considered. 

2 The Algorithm 

In the following, jobs are denoted by J, Ji, J 2 , etc. For a job J, £[J] and u[J] 
describe the intervals of stationary stages to which J could be feasibly assigned 
subject to the visibility constraints. The intervals u[J]] are to be interpreted 
as wrap-around intervals, that is, ii u < £, the notation [i, m] is a shorthand for 
[i,u] = 1, . . . ,s,l, . . . ,u — l,u}. 



2.1 Reduction to a Core Problem 

The core problem requires 0{N m^s^) time, and the algorithm for the core 

problem is applied 0(min{A^s^b -^*})) times. Both parts together give the total 
run time claimed above. The core problem differs from the original problem in 
two respects: 

1. The stepping scheme is fixed. In other words, it remains to compute the 
assignments of jobs to stationary stages. (Recall that the stepping scheme 
comprises the number s of forward steps and the offsets of the individual 
forward steps.) 

Already this assignment problem of jobs to stationary stages can be shown to 
be weakly AfP~hard by a straightforward reduction from PARTITION (the 
decision version of which is well-known to be weakly AfP-complete [GJ79]). 

2. Due to this hardness result, the problem is slightly relaxed in the following 
way: we do not necessarily assign each job to a single stationary stage. In- 
stead, a stationary stage may be finished while a job is being processed, so 
this job is resumed at the beginning of the very next stationary stage. In a 
postprocessing at the end of the algorithm, each such job is then assigned to 
exactly one stationary stage to which a part of the job was assigned by the 
core algorithm. 

Fixing the stepping scheme amounts to enumerating all possible stepping schemes. 
In Section 4 we will show that it suffices to consider 0(min{A^s®'^ , N^}) stepping 
schemes as claimed above. 
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Linear joh orderings: For each machine, we can define an interval-induced partial 
order on the jobs. In this partial order, job Ji is a predecessor of job J2 if and 
only if either ^[Jl] < d[J2] or f [Ji] = (-[J2] and u[Ji] < u[J2]- Visibility conditions 
in an assembly line ensure that no interval of a job is strictly contained in that of 
another job. Hence, two jobs J\, J2 are incomparable if and only if t[J\] = f[J2] 
and u[Ji] = u[J2]- 

Note that in the fractional model the precise order of two incomparable jobs is 
irrelevant for the overall cycle time. Hence, we may simply fix one linear extension 
of the interval-induced partial order for each machine. We call such an extension 
an interval-consistent linear order. For notational simplicity, we assume that 
jobs are numbered with respect to this order, i.e. Ji^i Ji_2 ^ ^ on 

machine i. 

Simplified instances: In the relaxed model, we may treat all jobs with the same 
interval of feasible stages as one larger job. Hence, a simplified instance will have 
at most O(s^) jobs on each machine, for a total of 0 (ms^) jobs over all machines. 

Error bound: The relaxation introduces an error. The following lemma will show 
that this error can be reasonably bounded. As discussed above, (72 • 9 s may be 
assumed to be orders of magnitude smaller than 1 . 

Lemma 1 . The relative error is at most min{l,(72 • (Zs}- 

Proof. Obviously, the absolute error is bounded by s ■ Pmax, where Pmax de- 
notes the maximal job length. First we will show that the absolute error is even 
bounded by s • Pmax/‘^- This follows from the before-mentioned postprocessing 
to convert a relaxed solution to an integral one. Namely, we decide iteratively 
for each stage where to assign the splitted jobs. For stage i, let split{i) denote 
the set of splitted jobs which start in stage i and are continued in stage z -I- 1 . 
Let p[J] be the length of a job and Pi[J] be the amount of time that job J is 
executed within stage z. Define 

Ti := max{pi[J] | J G split{i)} and T2 := max{p[J] — pi[J] \ J G split{i)}. 

If Ti > T2, then we assign all jobs in split{i) completely to stage z (which implies 
that the length of stage z increases by T2). Otherwise, we shift for all jobs in 
split{i) the portion which is executed in stage z to stage z -I- 1 . In this case, the 
length of stage z -I- 1 increases by at most Ti. (The increase might be strictly 
shorter if a shifted job fills up idle time.) 

Let CTfrac denote the optimal cycle time with respect to the relaxed model, 
and let CTapp be the cycle time of the approximation which we obtain by the 
above conversion routine. In each rounding step, the makespan of a stage can at 
most double. Hence, we immediately get 

CTapp < 2 • CTfrac- 

Thus, we obtain at least a 2 -approximation for the minimum cycle time, and it 
remains to show that the relative error is bounded by Q2 • 93. Denote by P the 
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sum of processing times over all jobs, and by Pmax and Pmin the maximal and 
minimal processing time over all jobs, respectively. Clearly, we also get 



CTapp < CTfrac 



s 

"^Pmax- 



To bound the relative error, we bound the optimal cycle time from below by 
the ratio of the total sum of all job lengths divided by m. The total sum of all 
jobs is the product of N and the average job lengths. Replacing the average job 
length by the minimal job length eventually gives an upper bound of <72 • Qs for 
the relative error as claimed in the introduction. 



If the number of machines m and the number of stages s are both fixed 
constants (that is, they are not part of the input), then we can easily derive a 
polynomial approximation scheme. 

We use the standard trick to divide jobs into long and small jobs. Denote 
by LB a lower bound for the cycle time CT. As a lower bound, we may simply 
take the maximum load of all machines or fractional solution value CTfrac- For 
a given e > 0, we define that job J is a long job if p[J] > and J is a small 

job, otherwise. 

Let longi be the number of long jobs on machine i. As LB > — 

longi^^j^, we have at most longi < ^ long jobs on machine i, for a total of at 
most ^ on all machines. 

Our approximation scheme works as follows. The idea is to fix the assignment 
to stages for all long jobs. For each possible assignment of long jobs to stages 
we iteratively solve the fractional relaxation and use the conversion routine to 
an integral assignment from above. The crucial observation is that our core 
routine can also solve the relaxed problem subject to these side constraints. The 
algorithm returns the best solution obtained in one of the iterations. We need at 
most 0(s^ ) iterations, and each iteration runs in time 0{m^s^ + N). Hence, 
as m and s are fixed, we get a polynomial algorithm. 

Denote by Tmax,i the maximum processing time of a job which is only par- 
tially assigned to stage i. Since now only small jobs are splitted in the relaxed 
solution, we conclude for the obtained integral assignment 



S 

LB < CTfrac ^ CT < CTfrac + ^ ^ Tmax,il‘^ 



< CTfrac 

< CTfrac 



i=l 

s 2LBe 
+ 2 '^ 

-|- sLB < CTfraci^ + s)) 



which proves the desired performance guarantee of the approximation scheme. 



2.2 An Algorithm for the Core Problem 

We are now going to present an 0{N+m^s^) time algorithm for the core problem. 
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Steps of the core algorithm: The algorithm consists of four steps. Step i computes 
an assignment sti[J] for every job J. More precisely, as we here allow jobs to be 
splitted over several consecutive stages, sti[J] denotes the stage when job J is 
started, whereas sti^c[J] refers to the stage when job J is completed. 

Step 1: For each job we first choose the first stationary stage at which it can be 
scheduled subject to the visibility constraints, that is, we choose := £[J]. 

However, if a job can be realized at any stationary stage, we choose := s. 

Step 2: Next we iterate over the stationary stages from 1 to s — 1 and do the 
following in the i-th iteration. Let J[i] be the set of jobs such that sti[J] = i. 
First we build an earliest start schedule S'[f] for the jobs in J[i]. Let Ci[J] be 
the completion time of job J with respect to S'[z]. Moreover, let t[i] be the first 
moment in time where all those jobs of J[i] are finished in S'[z] that have definitely 
to be finished in the z-th stationary stage (due to the visibility constraints). In 
other words, 

t[i] := u\&yi{^Ci[J] | J € J[i] and u[J] = z}. 

Next we interrupt each job which remained active after t[i] and shift the remain- 
ing portion to the following stationary stage z -I- 1. Similarly, each job which is 
started at or after t[i] is shifted to stationary stage z -I- 1. This finishes the z-th 
iteration of the loop. The result after the last iteration is st 2 [']- 

Step 3: First we build an earliest start schedule for the stationary stage s. Then 
we iterate over all machines and shift jobs (or portions of them) assigned to the 
stationary stage s backwards (if possible) to avoid idle times in earlier steps. 
(Note that this will only be possible for jobs which can be assigned to every 
stationary stage.) The result is sts)-]. 

Step 4- Denote by C[i] the current makespan for the stationary stage z and 
by Cm)*] the current makespan for station M in the stationary stage z. Denote 
by Ci[J] the completion time of job J in the stationary stage z. As usual, the 
minimum over an empty set is defined to be -l-oo. A station is said to be critical 
for the stationary stage z if the makespan of z is assumed at that station. Let 
Mcrit[i] be the set of critical machines of the stationary stage z. Let Si denote the 
maximal amount by which we can shift jobs from stationary stage s to stationary 
stage 1 until one more machine becomes critical for stationary stage s: 

£i := min {C[s] — Cm[s] \ C[s] > Cm[s] for M G At}. 

Moreover, let £2 denote the maximal amount by which we can shift jobs from 
stationary stage s to stationary stage 1 without violating the right bounds for 
the assignment of any job: 

£2 := min{C[s] — Cs[J] \ stslJ] = s and u[J] = s}. 

For station M and stationary stage z let lastmli] be the last job which is executed 
on machine M within stationary stage i. For a job J on machine M denote by 
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chainf{J), called the forward chain of J, the inclusion-maximal chain of jobs 
J Ji Jk scheduled on machine M after J without any idle time. Let 

last{chainf{J)) = Jk be the last job within this chain. For each critical machine 
we consider the next available idle slot which we can fill. The value £3 denotes 
the minimum over all these slots: 

£3 := min{C[j]- C mIj] \j= st3^c[last{chainf{J))],J= lastM[s], Mg Mcrit[s]}- 

For each critical machine M of stationary stage s our goal is to shift the complete 
forward chain chainf{lastM[s]) by some £. Consider such a chain ch on machine 
M.lich contains a stationary stage i and a job J with CmW = C[i], lastM[i] = J 
and u[J] = i, then shifting is only possible if the beginning of stationary stage 
t -I- 1 is also shifted by £. If, in turn, stationary stage i + 1 has a machine M' 
and a job J' with the property J' = firstM[i + 1] and £[J'] = i -I- 1 then we also 
have to shift the forward chain of J'. Repeating this argument, we collect a set 
of chains chset and a set of stationary stages bset which all have to be shifted. 

In this collection step, we may observe that there is a cyclic dependency, that 
is, there are two chains in chset which end with the same job J. In such a case, 
the algorithm terminates immediately. We call this event termination because of 
cyclic dependency. Define 

£4 := min{C[t] — Ci[J] \ J G chset, sts^dJ] = i and u[J] = i}, 



and 

£5 := min{S'i[J] | 5^3 [J] = i, i G bset, £[J] = i and J ^ chset}, 

where denotes the start time of J with respect to stationary stage i. 

If s G bset, redefine £1 := £i/2 and £2 := £ 2 / 2 . After this potential redefini- 
tion, let £ := min{£i, £ 2 , £3, £4, £5}. 

If £ = 0, the algorithm terminates. Otherwise, we shift from each critical 
machine of stationary stage s the amount of £ to stationary stage 1 and update 
all schedules appropriately. In particular, jobs of a stationary stage in bset which 
do not belong to chset will be shifted backwards if this is feasible and fills idle 
time. We repeat Step 4 until £ = 0. The result, st 4 [-], is the output of the 
algorithm. 

This concludes the description of our algorithm. See Figure 2 for a small 
example. 



3 Correctness Proof 

In Section 2.1, the correctness of the reduction was exhaustively discussed. 
Therefore, it remains to prove that the algorithm for the core problem is correct. 
The correctness proof is based on a couple of algorithmic invariants. 

Invariant 1 Throughout the algorithm, each job is legally assigned to stationary 
stages according to the definition of the core problem. 
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Fig. 2. A small example instance with to = 3 machines and s = 5 stationary 
stages. On top, we show a Gantt-chart of the situation after Step 3. Below, 
we show the situation after one iteration of Step 4. Note that this situation is 
already optimal. 



Invariant 2 A stationary stage i G {1,2,. ..,s — 1} does not finish unless some 
job J has been completed such that u[J] = i. 

For a job J on machine M denote by chainb(J), called the backward chain of 
J, the inclusion-maximal chain of jobs Ji J 2 Jk J scheduled on 

machine M before J without any idle time. (If there is no idle time on machine 
M at all, the algorithm stops with an optimal solution.) Let first{chainb{J)) 
be the first job of the backward chain chainb{J). Note that, by definition, 
first{chainb{J)) is always the first job of some stationary stage. 

Invariant 3 After Step 3, the following holds throughout the rest of the algo- 
rithm: For a job J with st 3 [J] > i[J], we have st^lJf] = i[Jf] where Jf = 
first{chainb{J))- 

For stationary stage i, let last(i) be a job which realizes its makespan and 
satisfies u[last{i)] = i. By Invariant 2, last{i) always exists for 1 < j < s — 1. 
The backward chain of last{i) starts with a job Jf = first(chainb{last(i))). By 
Invariant 3, we have stslJf] = i[Jf]- Hence, the backward chain of lastii) gives 
us a certificate that the difference between the completion time of stationary 
stage i and the start time of stationary stage is as small as possible. 

Certificates of optimality Clearly, the job assignment is optimal if there is a 
machine M without any idle time. A second type of certificate for optimal- 
ity can be derived by combining backward chains from different machines: Let 
c/iQ, c/ii, . . . , chk be chains of jobs where cho = chainb{last{io) for some station- 
ary stage io, i\ = first{chainb{last(io))) — 1 and chi = chainb{last{i\) , ..., 
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= first{chainb{last{ik — 1))) — 1 and chk = chainb{last{ik). If the concatena- 
tion of these chains covers all index steps at least once then the job assignment 
is also optimal. 

Lemma 2. The proposed algorithm finds an optimal fractional job assignment 
for s stages in Ofm^s^ + N) time, where N is the total number of jobs. 

Proof. By Invariant 1, the algorithm always yields a feasible fractional assign- 
ment. We first show that the algorithm terminates. At most to — 1 = |AI| — 1 
times we can choose £ = £i as each time we add at least one critical machine 
to Aicrit[s] and A4crit[s] never decreases. Only once it can happen that £ = £2 
as afterwards we have a certificate of optimality. At most to(s — 1) times we 
can choose £ = £3 directly after another as each time one idle slot vanishes. 
As long as A4crit[s] remains unchanged, £ = £4 can be chosen at most once for 
each stage. Similarly, £ = £5 will occur at most to • s times before A4crit[s] must 
change. Thus, in total we have at most 0{mfs^) executions of Step 4. Steps 1-3 
each require 0{N) time, each execution of Step 4 can be done in 0{sm) time 
which yields the claimed time bound. 

To show optimality at termination, we have to discuss the different possibil- 
ities for £ = 0. Note that we always have £1 > 0. Similarly, by definition £4 > 0 
and £5 > 0. If £3 = 0, then there is a critical machine M without any idle time 
in the current schedule. So the job assignment is clearly optimal in this case. 

It remains the case £2 = 0. Hence, there is a critical machine M with 
M[^astM[s]] = s. Now consider the backward chain of the job J = lastM[s]. 
Let cho = chainb{J), i\ = first{chainb{J) — 1 and chi = chainb{last{ii) . 
Continue in this manner by defining ij = first{chainb{last{ij — 1))) — 1 and 
chj = chainb{last{ij) until the concatenation of these chains covers all stages at 
least once. Note that Invariant 2 guarantees that this process will not stop before 
all stages are covered. Hence, we have found a certificate for optimality. Finally, 
we obviously also have a certificate for optimality if the algorithm terminates 
because of a cyclic dependency. 

4 Number of Subproblems to Be Considered 

Consider a stationary stage. Such a stage has position x with respect to the 
reference point 0 , if the left borders of the workpieces on the assembly line are 
at positions x + k ■ WP for integral k values. Here, WP denotes the workpiece 
pitch, that is, the distance between the left borders of two successive workpieces 
on the assembly line. For a stationary stage with position x, let MT\x] be the set 
of jobs which can be executed at this position of the board by the corresponding 
machine. We say that stationary stage positions a;i,a ;2 are equivalent if the sets 
MT[xi] and MT[x 2 ] are identical. This equivalence relation partitions the in- 
terval [0, WP] into at most 0{N) subintervals Ii, I 2 , . . . , It, t < 2N (namely its 
equivalence classes), which can be determined by a straight-forward sweep- line 
algorithm in linear time. Just note that during the sweeping of the stage position 
X from [0, WP] each job is added to and removed from MT\x] exactly once. If 




160 



Matthias Miiller-Hannemann and Karsten Weihe 



we enumerate over every s-element subset of these intervals as candidates for the 
subroutine which solves the problem for a fixed stepping scheme to optimality, 
we will find the overall optimal solution. This requires at most 0{N^) iterations. 

This bound can be improved substantially, if we exploit that possible forward 
steps are quite granular (recall the discussion from the Introduction): the offset 
by which a workpiece is moved in a forward step must be a multiple of A. 
Each stationary stage position x can therefore be expressed as x = xq + k ■ A 
for an appropriate integer k > 0 and a shift parameter xq with 0 < a;o < ^. 
The shift parameter xq is the offset of the start configuration to the reference 
point 0. Obviously, the number of stationary stages is bounded from above by 
the parameter qi := [lEP/Zl]. 

Hence, for fixed parameter xq we have to choose s elements from at most 
\WP/A'\ positions (instead from 0{N) positions). For a given xq, we say that 
we hit interval Ij = [^j^ Uj] if there is an integer k such that £j < a;o + kA < Uj. 
Now we can also define an equivalence relation for the parameter xq with respect 
to the intervals I\,l 2 , ■ ■ ■ , In for the N jobs. Namely, xq and xq are equivalent 
if the sets of intervals hit by them are identical. Again, we can use a sweep-line 
technique to determine all equivalence classes for the parameter xg- There are 
at most 0{N) different classes for the following reason: if we slide the parameter 
Xq from 0 to Z\ each of the t < N intervals Ij will be inserted to or removed 
from the set of hit intervals at most once. In summary, we have to consider at 
most 0(min{Ns^b A^®}) different selections of stationary stage positions. 

5 Conclusion 

In this paper, we have considered a variant of an assembly-line scheduling prob- 
lem of high practical importance. 

The main idea of our methodology is to identify parameters that not only 
allow efficient algorithms but also turn out to have small values in practice. In 
our specific application, we used a collection of such parameters which enabled 
us to develop an efficient approach with provably near-optimal results. In the 
future, we will try to extend this kind of analysis to other applications form our 
cooperation partners. 
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Abstract. Based on a series of known and new examples, we propose 
the generalized setting of “distance from triviality” measurement as a 
reasonable and prospective way of determining useful structural problem 
parameters in analyzing computationally hard problems. The underly- 
ing idea is to consider tractable special cases of generally hard problems 
and to introduce parameters that measure the distance from these spe- 
cial cases. In this paper we present several case studies of distance from 
triviality parameterizations (concerning Clique, Power Dominating 
Set, Set Cover, and Longest Common Subsequenge) that exhibit 
the versatility of this approach to develop important new views for com- 
putational complexity analysis. 

1 Introduction 

Vertex Cover is one of the NP-complete problems that stood at the cradle 
of parameterized algorithm design and analysis [11]. Given an undirected graph 
with n vertices and a nonnegative integer k, the question is whether we can 
find a set of at most k graph vertices such that each graph edge has at least 
one of its endpoints in this set. The currently best fixed-parameter algorithms 
exactly solve Vertex Cover in 0(1.3^ -I- kn) time [8,22]; that is. Vertex 
Cover is fixed-parameter tractable when parameterized by A:. A different way 
of parameterizing Vertex Cover is to consider the structure of the input graph. 
If the given graph allows for a tree decomposition [26,4] of width w, then it is 
well-known that Vertex Cover can be solved in 0(2™ -n) time [30] independent 
of the size k of the cover set we are searching for. Hence, Vertex Cover is also 
fixed-parameter tractable when parameterized by w. As a rule, most problems 
can be parameterized in various reasonable ways.^ The example Vertex Cover 
exhibits two fundamentally different ways of parameterization — “parameterizing 
by size” (i.e., the size of the vertex cover) and “parameterizing by structure” 
(i.e., the treewidth of the underlying graph). In this paper we propose to take 

* Supported by the Deutsche Forschungsgemeinschaft (DFG), Emmy Noether research 
group PIAF (fixed-parameter algorithms), NI 369/4. 

^ For instance. Fellows [13] discusses how to parameterize the Max Leaf Spanning 
Tree problem in at least five different ways. 
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a broader, generalized view on parameterizing problems by structure, leading 
to a generic framework of new research questions in parameterized complexity 
analysis. 

The leitmotif of parameterized complexity theory [11] is to gain a better un- 
derstanding of problem hardness through a refined complexity analysis that uses 
a two-dimensional view on problems by means of parameterization. A natural 
way to do this is as follows. Consider a problem such as Vertex Cover and find 
out what efficiently solvable special cases there are known. For instance. Vertex 
Cover is trivially solvable on trees. Now, for example, consider the parameter d 
defined as the number of edges that have to be deleted from a graph to transform 
it into a tree. In this sense parameter d measures the “distance from triviality” 
and one may ask whether Vertex Cover is fixed-parameter tractable when 
parameterized by d. In this simple example the answer is clearly “yes” because 
such a graph has treewidth bounded by d -I- 1 [3] and, thus, Vertex Cover 
can be solved using the tree decomposition approach [30]. But in other cases 
this “distance from triviality” approach to parameterization often leads to in- 
teresting new research questions: For instance, in a recent work Hoffmann and 
Okamoto [19] describe a fixed-parameter algorithm for the Traveling Sales- 
man Problem in the two-dimensional Euclidean plane based on the following 
distance from triviality parameterization: Consider a set of n points in the Eu- 
clidean plane. Determine their convex hull. If all points lie on the hull, then 
this gives the shortest tour. Otherwise, Hoffmann and Okamoto show that the 
problem is solvable in 0{k\ ■ k ■ n) time where k denotes the number of points 
inside the convex hull. Thus, the distance from triviality here is the number k 
of inner points. 

In this paper we extend the distance from triviality concept to a broader 
setting and we discuss further examples for the fruitfulness of this new param- 
eterization methodology. We present further recent examples from the litera- 
ture concerning Satisfiability [28] and Graph Coloring [7] that fit into our 
framework. In addition, we provide four new fixed-parameter tractability results 
using this framework for Clique, Power Dominating Set, Set Cover, and 
Longest Common Subsequence. Given all these case studies, we hope to con- 
vince the reader that, in a sense, “parameterizing away from triviality” yields a 
generic framework for an extended parameterized complexity analysis to better 
understand computational (in)tractability. Further aspects of our scenario and 
its prospects for future research are discussed in the concluding section. 

2 Preliminaries and Previous Work 

Preliminaries. Parameterized complexity theory [11] offers a two-dimensional 
framework for studying the computational complexity mostly of NP-hard prob- 
lems. A parameterized language (problem) T is a subset L C E* x E* for some 
finite alphabet E. For (x, k) G L, by convention, the second component denotes 
the parameter. The two dimensions of parameterized complexity analysis are 
constituted by the input size n := \{x,k)\ and the parameter value k (usually 




164 Jiong Guo, Falk Hliffner, and Rolf Niedermeier 



a nonnegative integer) . A parameterized language is fixed-parameter tractable if 
it can be determined in f{k) ■ time whether (x, k) G L, where / is a com- 

putable function only depending on k. Since the parameter k represents some 
aspect(s) of the input or the solution, there usually are many meaningful ways 
to parameterize a problem. An important issue herein is whether a problem is 
fixed-parameter tractable with respect to a chosen parameter or not (i.e., W[l]- 
hard, see [11] for details), and, in case of fixed-parameter tractability, how small 
the usually exponential growth of the function / can be kept. Hence, investigat- 
ing different parameterizations gives insight into what causes the computational 
(in)tractability of a problem and in which qualitative and quantitative sense this 
happens. Refer to [10,14,21] for recent surveys on parameterized complexity. 

Previous Work. The aim of this paper is to stimulate research on the structural 
parameterization “distance from triviality.” Clearly, one of the most sophisti- 
cated examples in this context is the notion of bounded treewidth developed 
by Robertson and Seymour [26]. Without going into details, we remark that 
the basic motivation for considering this concept can be derived from the fact 
that many NP-hard graph problems (such as Vertex Cover) become easy 
(linear-time solvable) on trees. Treewidth then measures how tree-like a graph 
is, and if this parameter is small, then many otherwise hard graph problems can 
be solved efficiently (see [4] for a survey). In this sense treewidth measures the 
distance from the triviality “tree” and problems such as Vertex Cover are 
fixed-parameter tractable with respect to this structural parameter [30] . 

Another prominent problem is Graph Coloring. Leizhen Cai [7] recently 
initiated a study of Graph Goloring which falls into our framework. For 
instance, considering split graphs (where Graph Goloring is solvable in poly- 
nomial time) he showed that Graph Coloring is fixed-parameter tractable 
with respect to parameter k on graphs that originate from split graphs when 
adding or deleting k edges. By way of contrast, it is W[l]-hard when deletion 
of k vertices leads to a split graph. Interestingly, the problem is much harder in 
case of bipartite graphs instead of split graphs: Graph Goloring becomes NP- 
complete for graphs that originate from bipartite graphs by adding three edges 
or if two vertex deletions are needed to make a graph bipartite. In summary, Cai 
states that “this new way of parameterizing problems adds a new dimension to 
the applicability of parameterized complexity theory” [7].^ 

Finally, to emphasize that not only graph problems fall into our framework 
we give an example with Satisfiability. It is easy to observe that a boolean 
formula in conjunctive normal form which has a matching between variables and 
clauses that matches all clauses is always satisfiable. For a formula F, considered 
as a set of m clauses over n variables, define the deficiency as S{F) := m — n. 
The maximum deficiency is 6*{F) := max.pi(ZF d(F'). Szeider shows that the 
satisfiability of a formula F can be decided in 0(2'^ ■ n^) time [28]. Note that 

^ Juedes et al. [20] show that coloring an n-vertex-graph with n — k colors is fixed- 
parameter tractable with respect to k. Clearly, A: = 0 is trivial. This parameteriza- 
tion, however, is not a structural one. 
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a formula F with S*{F) = 0 has a matching as described above. Again, 6*{F) is 
a structural parameter measuring the distance from triviality in our sense. 

In the following sections we provide new case studies for the applicability of 
the distance from triviality concept in various contexts. Clearly, it is conceiv- 
able that several other examples from the literature will fit as examples into 
our concept.^ An important point, however, is that all of the parameterizations 
discussed here have nothing to do with the solution itself (i.e., the value to be 
determined or optimized) . Our parameterizations are structural ones. 



3 Case Study Clique 

The Clique problem is defined as follows: 

Input: A graph G = {V, E) and a nonnegative integer s. 

Question: Does G contain a clique, i.e., a complete subgraph, of size si 

Clique is W[l]-complete with respect to the natural parameter s [11]. It 
is also hard to approximate to an arbitrary constant factor. Here we exhibit 
fixed-parameter tractability with respect to the distance from a trivial case. 

Our trivial case is the class of cluster graphs: graphs which are a disjoint 
union of cliques. Clique can be trivially solved in linear time on such graphs. We 
examine Clique on graphs which are “almost” cluster graphs, namely, on graphs 
which are cluster graphs with k edges added. From a general result on graph 
modification problems by Leizhen Cai [6] it follows that finding the added k 
edges is fixed-parameter tractable with respect to k. Improved algorithms for 
this problem (which is known as Cluster Deletion) were given by Gramm et 
al. [15,16], providing an algorithm running in 0(1.53^ -I- [C]^) time. 

It remains to show how to solve Clique for the “almost cluster graph” G 
after identifying the k added edges and the corresponding cluster graph G' . If 
the largest clique in G is not one which is already contained in G", then each 
of its vertices must have gained in degree by at least one compared to G' . This 
means it can only be formed by a subset of the up to 2k vertices “touched” 
by the added edges. Hence, we solve Clique for the subgraph of G induced by 
the up to 2k vertices which are endpoints of the added edges. This step can be 
done for example by using Robson’s algorithm for Independent Set [27] on 
the complement graph in 0(1.22^^) = 0(1.49^) time, which is dominated by the 
above time bound for the Cluster Deletion subproblem. The largest clique 
for G is simply the larger of the clique found this way and the largest clique 
in G' . We obtain the following theorem: 

Theorem 1. Clique for a graph G = {V, E) which is a cluster graph with k edges 
added can he solved in 0(1.53^ -I- [C]^) time. 

® For instance, Nishimura et al. [23] developed algorithms for recognizing general 
classes of graphs generated by a base graph class by adding at most k vertices. Their 
fixed-parameter tractability studies are closely related to our methodology. 
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4 Case Study Power Dominating Set 

Domination in graphs is among the most important problems in combinatorial 
optimization. We consider here the Power Dominating Set problem [18], 
which is motivated from applications in electric networks. The task is to place 
monitoring devices (so-called PMUs) at vertices such that all edges and vertices 
are observed. The rules for observation are: 

1. A PMU in a vertex v observes v and all incident edges and neighbors of v. 

2. Any vertex that is incident to an observed edge is observed. 

3. Any edge joining two observed vertices is observed. 

4. If a vertex is incident to a total of t > 1 edges and if z — 1 of these edges are 
observed, then all z edges are observed. This rule is reminiscent of Kirchhoff’s 
current law from electrical engineering. 

We can now formulate the Power Dominating Set problem: 

Input: A graph G = (V,E) and a nonnegative integer k. 

Question: Does G have a power dominating set of size at most k, 
that is, a subset M C V of vertices such that by placing a PMU in 
every v G M, all vertices in V are observed? 

Power Dominating Set is NP-complete [18]. There is an algorithm known 
which solves Power Dominating Set in linear time on trees [18]. Since we use 
this algorithm as a building block for our result, we briefly sketch how it proceeds. 
This algorithm works bottom-up from the leaves and places a PMU in every 
vertex which has at least two unobserved children. Then it updates observation 
according to the four observation rules and prunes completely observed subtrees, 
since they no longer affect observability of other vertices. 

Our goal is now to find an efficient algorithm for input graphs that are 
“nearly” trees. More precisely, we aim for a fixed-parameter algorithm for graphs 
which are trees with k edges added. 

Note that a tree with k edges added has treewidth bounded by fc-|-l [3]. While 
Dominating Set is fixed-parameter tractable with respect to the parameter 
treewidth [2], no such result is currently known for Power Dominating Set. 
This motivates our subsequent result. 

As a first step we present a simple algorithm with quadratic running time 
for the case of one single added edge. 

Leuiuia 1. Power Dominating Set for a graph G = (V,E) with n := \V\ 
which is a tree with one edge added can he solved in O(zz^) time. 

Proof. Graph G contains exactly one cycle and a collection of trees Ti touching 
the cycle at their roots. 

We use the above mentioned linear time algorithm to find an optimal solution 
for each Ti. When it reaches the root ri, several cases are possible: 

~ The root ri needs to be in M, and we can remove it. This breaks the cycle, 
and we can solve the remaining instance in linear time. 
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— The root is not in M , but already observed. Then all children of in Ti 
except for at most one are observed, or we would need to take ri into M . 
Then, we can remove Tj except for ri and except for the unobserved child, 
if it exists. The root ri remains as an observed degree-2 or degree-3 vertex 
on the cycle. 

— The root still needs to be observed. This is only possible if it has exactly 
one child in Ti which is unobserved since otherwise either would be in M, 
or be observed. As in the previous case, we keep and the unobserved child, 
and the rest of Tj can again be removed. 

If after these data reductions two observed vertices are adjacent on the cycle, 
their connecting edge becomes observed, and we can break the cycle. Otherwise, 
we call it a reduced cycle. 

At least one vertex on the reduced cycle has to be added to M. We simply 
try each vertex. After each choice, the rest of the cycle decomposes into a tree 
after pruning observed edges, and can be handled in linear time. From all possible 
initial choices, we keep the one leading to a minimal M, which then is an optimal 
choice for the initial problem. Since there are 0{n) vertices and edges on the 
cycle, this takes O(n^) time. □ 

We note without proof that Lemma 1 can be improved to linear time by ex- 
amining a fixed-size segment of the cycle. In each possible case we can determine 
at least one vertex in the segment which has to be taken into M. 

Lemma 1 is applicable whenever each vertex is part of at most one cycle. We 
now generalize this and Haynes et al.’s [18] result. 

Theorem 2. Power Dominating Set for a graph which is a tree with k edges 
added is fixed-parameter tractable with respect to k. 

Proof. We first treat all trees which are attached in single points to cycles as 
in the proof of Lemma 1. What remains are degree-2 vertices, degree-3 vertices 
with a degree-1 neighbor, and other vertices of degree 3 or greater, the joints. 
For a vertex v, let degw denote its degree, that is, the number of its adjacent 
vertices. We branch into several cases for each joint: 

— The joint v is in M . We can prune it and its incident edges. 

— The joint v is not in M. Note that the only effect v can still have is that 
a neighbor of v becomes observed from application of observation rule 4 
( “Kirchhoff ’s current law”) applied to v. We branch further into degv • 
(degw — 1) cases for each pair {w\,W 2 ) of neighbors of v with w\ W 2 . 
In each branch, we omit the edges between v and all neighbors of v ex- 
cept wi and W 2 . Clearly any solution of such an instance provides a solution 
for the unmodified instance. Furthermore, it is not too hard to show that if 
the unmodified instance has a solution of size s, then on at least one branch 
we will also find a solution of size s. To see this, consider a solution M for the 
unmodified problem. Vertex v is observed; this can only be either because 
a neighbor wi of v was put into M, or because there is a neighbor wi of v 
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such that the edge {wi,u} became observed from observation rule 4. Fur- 
thermore, as mentioned, there can be at most one vertex W 2 which becomes 
observed by observation rule 4 applied to v. Then M is also a valid solution 
for the branch corresponding to the pair (wi,W 2 )- 

In each of the less than (deg v)"^ branches we can eliminate the joint. If we branch 
for all joints in parallel, we end up with an instance where every connected com- 
ponent is a tree or a cycle with attached degree-1 vertices, which can be solved 
in linear time. The number of cases to distinguish is Ow is joint Since 

there are at most 2k joints, each of maximum degree k, the total running time is 
roughly bounded by 0{n ■ confirming fixed-parameter tractability. □ 

5 Case Study Tree-like Weighted Set Cover 

Set Cover is one of the most prominent NP-complete problems. Given a base 
set S = {si, S 2 , . . . , Sn} and a collection C of subsets of S', C = {ci, C 2 , . . . , Cm}, 
Ci C S for 1 < i < m, and Ui<i<m G = S, the task is to find a subset C of C with 
minimal cardinality which covers all elements in S, i.e., UceC' c = S. Assigning 
weights to the subsets and minimizing the total weight of the collection C 
instead of its cardinality, one naturally obtains the Weighted Set Cover 
problem. We call C the minimum set cover of S resp. the minimum weight 
set cover. We define the occurrence of an element s G S in (7 as the number 
of subsets in C which contain s. Set Cover remains NP-complete even if the 
occurrence of each element is bounded by 2 [24]. 

Definition 1 (Tree-like subset collection). 

Given a base set S = {si, S 2 , . . . , s„} and a collection C of subsets of S, C = 
{ci, C 2 , . . . , Cm}. We say that C is a tree-like subset collection of S if we can 
organize the subsets in C in an unrooted tree T such that every subset one-to- 
one corresponds to a node of T and, for each element Sj G S , 1 < j < n, all 
nodes in T corresponding to the subsets containing Sj induce a subtree of T . 

We call T the underlying subset tree and the property of T that, for each s G 
S, the nodes containing s induce a subtree of T, is called the ^^consistency prop- 
erty” of T. Observe that the consistency property is also of central importance in 
Robertson and Seymour’s famous notion of tree decompositions of graphs [26,4]. 
By results of Tarjan and Yannakakis [29], we can test whether a subset collec- 
tion is a tree-like subset collection and, if so, we can construct a subset tree for 
it in linear time. Therefore, in the following we always assume that the subset 
collection is given in form of a subset tree. For convenience, we denote the nodes 
of the subset tree by their corresponding subsets. 

Here, we consider the Tree-like Weighted Set Cover (TWSC) problem 
with bounded occurrence which is defined as follows: 

Tree-like Weighted Set Cover with bounded ocgurrence: 

Input: Given a base set S = {si , S 2 , . . . , s„| and a tree-like collection C 
of subsets of S', C = {ci, C 2 , . . . , Cm}- Each element of S can be in at 
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most d subsets for a fixed d > 1. Each subset in C has a positive real 
weight w{ci) > 0 for 1 < z < TO. The weight of a subset collection is the 
sum of the weights of all subsets in it. 

Task: Find C' C C with minimum weight which covers all elements 
in S, i.e., UcgC' c = S. 

TWSC with bounded occurrence d > 3 is NP-complete even if the underlying 
subset tree is a star [17]. However, it can be solved in 0{rn?n) time if the under- 
lying subset tree is a path [17]. Now our goal is, based on the “trivial” path-like 
case, to give a fixed-parameter algorithm for TWSC with bounded occurrence 
where the number of leaves of the subset tree functions as the distance parameter 
from the path-like case. 

The fixed-parameter algorithm. Given a subset tree T with k leaves, the following 
observations are easy to prove. 

Observation 1. The maximum degree of the nodes of T is upperbounded by k. 
Observation 2. The number of tree nodes with more than 2 neighbors is upper- 
bounded by k. 

Observation 3. For each c € C, the number of subsets c' € C which share some 
elements with c, i.e., cDc' yf 0, is upperbounded hy d-k. These subsets and c 
induce a subtree of T. 

For a subset c € C, let deg c denote the degree of its corresponding node in T. 
The basic idea of the algorithm is to divide the tree-like instance into several 
“independent” path-like instances and to solve these instances separately by 
using the 0{m^n)-time algorithm. Instances are independent if they share no 
common elements with each other. For each tree node c with degc > 3, we 
construct a set He := {c' | c C c' yf 0}. Note that \Hc\ < d ■ k hy Observation 3. 
To cover all elements of c, we have to add some subsets from He into the set 
cover. We delete the subsets added into the set cover and all their adjacent edges 
from T. Furthermore, we delete elements of S which are already covered from 
all remaining subsets. Observe that if c is in the set cover, we retain several 
subtrees of T after deleting c; otherwise, c is now an empty subset. By deleting 
all empty subsets, the subset tree T is divided into several subtrees. Due to the 
consistency property of T, all these resulting subtrees are independent. Since the 
possible combinations of the subsets from He which cover all elements of c are 
upperbounded by 2*^^, we can have up to 2*^^ new instances by dividing T at c. 
By processing all nodes c with degc > 3 in the same way, there are ) new 

instances each of which consists of Ofm) independent path-like instances. Then 
the minimum set cover for one of these new instances is the union of the optimal 
solutions for the path-like instances with the subsets already added into the set 
cover while processing the nodes with degree at least 3. In summary, we get the 
following theorem: 

Theorem 3. TWSC with occurrence bounded by d can be solved in 0{2’^^ -mfu) 
time, where k denotes the number of the leaves of the subset tree. 
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Note that while results from [17] only cover cases with bounded subset size, 
we impose no such restriction here. However, here we need the bounded occur- 
rence restriction. 



6 Case Study Longest Common Subsequence 

In this section we deal with the Longest Common Subsequence problem, an 
important problem in theoretical computer science and computational biology. 

Longest Common Subsequence (LCS): 

Input: Given a set of k strings Xi,X 2 , . . . , Xk over an alphabet X and 

a positive integer m. 

Question: Is there a string X € X* of length at least m that is a 

subsequence of Xi for i = 1 , . . . , k? 

LCS is NP-complete even if IN’] = 2. Concerning the parameterized complex- 
ity of LCS with unbounded alphabet size, Bodlaender et al. [5] showed that LCS 
is W[t]-hard for t > 1 with k as parameter, W[2]-hard with m as parameter, 
and W[l]-hard with k and m as parameters. With a fixed alphabet size, LCS is 
trivially fixed-parameter tractable with m as parameter, but W[l]-hard with k 
as parameter [25]. 

Let n denote the maximum length of the input strings and sf denote the 
number of occurrences of letter a € X in Xi. We consider a new parameterization 
of LCS with k and s := maxagi; maxi<i<fc sf as parameters. To begin with, we 
show that the case s = 1 of this parameterization, where every letter occurs in 
each string only once, is solvable in polynomial time. 

Without loss of generality, we assume that all input strings have the same 
length n, X = {1,2,..., n}, and Xi = 1 2 3 • • • n. Then the strings X 2 , X 3 , . . . ,Xk 
are permutations of Xi . We construct a directed graph G with nxk vertices; each 
vertex represents a position in a string. A directed edge is added from Vij to Wi+i,; 
for 1 < i < /c iff the letters in position j of Xi and in position I of W+i are the 
same. It is easy to observe that a longest common subsequence one-to-one corre- 
sponds to a maximum set of directed paths in G which do not cross each other. 
Two paths P and P' cross each other if there is an edge {vij,Vi+i^i) in P and an 
edge {viji , Vi-i-i^ii ) in P' with j < j' and I > V . In order to find a maximum set of 
the noncrossing paths in G, we construct a “path-compatibility” graph PG{G) 
from G: For each directed path in G, we create a vertex Pa in PG{G) where a 
is the position in X\ where the path starts. We add a directed edge {Pa,Pt) 
from Pa to Pb if Pa does not cross Pb and a < b. Thus, PC{G) is an acyclic 
directed graph and a maximum set of noncrossing paths of G one-to-one corre- 
sponds to a longest path in PC{G). By using depth-first search for each vertex 
in PC{G) with in-degree of zero, we can easily find such a longest path. 

Concerning the running time to solve LCS on such an instance, we note that 
graphs G and PG{G) can be constructed in 0{k ■ n) and 0{k ■ n^) time, respec- 
tively. Finding a longest path in PG{G) can be done in 0{n) time. Summarizing, 
the running time for solving LCS on these instances is 0{k ■ v?). 
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The fixed-parameter algorithm. Given strings X\, X 2 , ■ ■ ■ , we construct a 
graph G with n ■ k vertices as described above. However, a vertex vig with 1 < 
i < k has to be connected to all vertices Vi+i^h, I < h < n, where Xi+i has 
the same letter in position h as Xt in position 1. Graph G can be constructed 
in 0{k-vfi) time. Then we construct the path-compatibility graph PG{G) from G. 
Since the number of paths in G can be up to rii<i<fe 

we have 0{n ■ s^) vertices in PG{G). Each vertex Pj^,j. 2 ,...,jk represents a path 
in G, and the indices ji, j’ 2 , ■ ■ ■ ,jk denote the positions in the k strings where 
this path passes. Furthermore, a directed edge from Pji,j 2 ,...,jk - 

is added into PC{G) if the corresponding paths do not cross each other, are 
vertex-disjoint, and ji < j{. It is easy to verify that by finding a longest path in 
this acyclic directed graph, we get a longest common subsequence of the input 
instance. The construction of the edges in PG{G) takes 0{k) time per vertex 
pair, and we obtain an algorithm running in 0{k ■ (n • s^)^) = 0(2^^*°®^ • k • vfi) 
time. This yields the following result. 

Theorem 4. Longest Common Subsequence can he solved in . 

k ■ n^) time, where s denotes the maximum occurrence number of a letter in an 
input string. 

7 Concluding Discussion 

The art of parameterizing problems is of key importance to better understand 
and cope with computational intractability. In this work we proposed a natural 
way of parameterizing problems — the parameter measures some distance from 
triviality. The approach consists of two fundamental steps. Assume that we study 
a hard problem X . 

1. Determine efficiently solvable special cases of X (e.g., in case of graph prob- 
lems, the restriction to special graph classes) — the triviality. 

2. Identify useful distance measures from the triviality (e.g., in case of trees 
and graphs the treewidth of a graph) — the (structural) parameter. 

As to step 2, observe that various distance measures are possible such as edge 
deletions or vertex deletions in case of graphs. It is important, however, that 
we can efficiently determine the distance of a given input instance to the chosen 
triviality with respect to the parameter considered. For instance, it is “easy” to 
determine the distance of a graph to being acyclic with respect to edge dele- 
tion (this leads to the polynomial-time solvable Feedback Edge Set problem) 
whereas it is hard to determine the distance of a graph to being bipartite with 
respect to edge deletion (this leads to the NP-hard Graph Bipartization prob- 
lem). However, in case we are explicitly given the “operations” that transform 
the given object into a trivial one (e.g., the edges to be deleted to make a graph 
bipartite), the question for the parameterized complexity of the underlying prob- 
lem with respect to the distance parameter might still be of interest. In the new 
case studies presented in this paper the distance measurement for Longest 
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Common Subsequence was easy to determine whereas in the Clique case the 
measurement led to an NP-hard but fixed-parameter tractable problem. 

We do not claim that all the parameterizations we considered generally lead 
to small parameter values. This was not the central point, which, by way of 
contrast, was to extend the range of feasible special cases of otherwise compu- 
tationally hard problems. As pointed out by an anonymous referee, it would be 
interesting to study more drastic distance measures such as considering relative 
distances — for instance, what if 1% of all edges may be edited in a graph. 

It is worth emphasizing that the tractable trivial case may refer to polynomial- 
time solvability as well as fixed-parameter tractability.^ An example for the 
latter case is Dominating Set on planar graphs which is fixed-parameter 
tractable [1,2]. These results were extended to graphs of bounded genus [12,9], 
genus here measuring the distance from the “trivial case” (because settled) pla- 
nar graphs. Moreover, the proposed framework might even be of interest in 
the approximation algorithms context where triviality then might mean good 
polynomial-time approximability (e.g., constant factor or approximation scheme). 
In summary, we strongly believe that distance from triviality parameterization 
leads to a wide range of prospective research opportunities. 
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Abstract. Haplotyping via perfect phylogeny is a method for retrieving haplo- 
types from genotypes. Fast algorithms are known for computing perfect phyto- 
genies from complete and error-free input instances — these instances can be or- 
ganized as a genotype matrix whose rows are the genotypes and whose columns 
are the single nucleotide polymorphisms under consideration. Unfortunately, in 
the more realistic setting of missing entries in the genotype matrix, even re- 
stricted forms of the perfect phylogeny haplotyping problem become NP-hard. 
We show that haplotyping via perfect phylogeny with missing data becomes 
computationally tractable when imposing additional biologically motivated con- 
straints. Firstly, we focus on asking for perfect phylogenies that are paths, which 
is motivated by the discovery that yin-yang haplotypes span large parts of the 
human genome. A yin-yang haplotype implies that every corresponding perfect 
phylogeny has to be a path. Secondly, we assume that the number of missing en- 
tries in every column of the input genotype matrix is bounded. We show that the 
perfect path phylogeny haplotyping problem is fixed-parameter tractable when 
we consider the maximum number of missing entries per column of the geno- 
type matrix as parameter. The restrictions we impose are met by a majority of the 
problem instances encountered in publicly available human genome data. 



1 Introduction 

1.1 Haplotype Inference from Genotypes Via Perfect Phylogeny 

Single nucleotide polymorphisms (SNPs) are differences in a single base, across the 
population, within an otherwise conserved genomic sequence. The sequence of SNP 
states in contiguous SNP positions along a chromosomal region is called a haplotype. 
The knowledge of the haplotypes in the human genome is of particular importance since 
they are believed to be often linked to medical conditions. However, current technolo- 
gies suitable for large-scale SNP detection in the human genome — which contains two 
versions of each chromosome — do not obtain the haplotypes but only the genotype in- 
formation: The genotype specifies, for every SNP position, the two states at this site in 
the two chromosomes. The genotype contains information only on the combination of 
SNP states at a given site, but it does not tell us which of the states belongs to which 
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chromosome. It is an important goal to develop efficient methods for inferring haplo- 
types from genotypes. 

Haplotyping via perfect phylogeny is a method for haplotype inference where it is 
assumed that the (unknown) haplotypes underlying the (observed) genotype data can 
he arranged in a genetic tree in which each haplotype results from an ancestor haplo- 
type via mutations. The perfect phylogeny approach is popular due to its applicability 
to real haplotype inference problems and its theoretical elegance. It was introduced 
by Gusfield [6] and received considerable attention which resulted, among others, in 
quadratic-time algorithms for the case of complete and error- free input data [1, 2]. In 
the special case where perfect path phytogenies are sought, even a linear time algorithm 
is known [5]. 

The main hurdle for current haplotype inference methods is missing input data. 
Real genotype data usually contain a small fraction of missing data caused by technical 
problems in the process of genotype detection. In the presence of missing data, haplo- 
typing via perfect phylogeny is NP-hard [8]. This is even true for the restricted case of 
path phytogenies [5]. In an effort to solve the problem efficiently for restricted cases, 
Halperin and Karp [7] show that perfect phylogeny haplotyping with missing data is 
tractable if the input satisfies the ‘rich-data hypothesis’. This hypothesis requires, intu- 
itively, that the data contains enough information to ‘locally’ infer the haplotypes at any 
two SNP sites. In this paper, we take a different approach which is independent of the 
rich-data hypothesis. We show that perfect phylogeny haplotyping is fixed-parameter 
tractable when restricted to path phytogenies and when taking the maximum number of 
missing entries at a particular SNP site as parameter. Notably, experiments on publicly 
available genotype data show a significant fraction of the data that allows perfect path 
phytogenies but fails to satisfy the rich data hypothesis. 

1.2 Computational Problems 

When stripped of the biological context, the haplotype inference problem is a purely 
combinatorial problem, which we describe in the following. In the combinatorial setting 
a haplotype is a binary string. Each position of the string corresponds to a SNP site. 
When we observe a certain base at the SNP site, the string contains a 0-entry at the 
corresponding position; if we observe a certain other base, the string contains a I -entry. 
In rare situations one may observe three or even four different bases at a specific SNP 
site, but these cases are not modeled. A haplotype matrix is a binary matrix whose rows 
are haplotypes. 

A genotype is a string over the alphabet {0, 1,2}. A 2-entry corresponds to a het- 
erozygous SNP site, meaning that the haplotypes underlying the genotype do not agree 
at that site. The genotype resulting from two haplotypes has a 0-entry or a 1 -entry at 
all positions where both haplotypes agree on a 0-entry or 1 -entry, respectively. It has a 
2-entry at all positions where they disagree. We say that a genotype matrix A admits a 
perfect phylogeny if there exists a rooted tree T , called perfect phylogeny, such that: 

1 . Each column of A labels exactly one edge of T . 

2. Every edge of T is labeled by at least one column of A. 

3. Eor each row r of A there are two nodes in T (possibly identical) labeled / and r". 

The labels / and r" are called haplotype labels. 
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Fig. 1. Example of a genotype matrix A and a corresponding perfect phylogeny T. The 
edges of T correspond to the columns of A. Each genotype row r in A results from 
two haplotypes r' and r" , which label vertices of the tree. The paths induced by the 
first row a of A are shown in bold: The path corresponding to the 1 -entries in row a 
leads from the root to an inner vertex of the tree (solid), the path corresponding to 
the 2-entries is rooted at this inner vertex (dashed). The latter path connects the two 
haplotype vertices a' and a" . 



4. For every row r of A the set of columns with value 2 in this row forms a path p inT 
between / and The set of columns with value 1 in this row forms a path from 
T’s root to the top-most node on the path p. 

An example is depicted in Figure 1. A detailed biological justification for considering 
perfect phytogenies and for requiring the above properties for the tree T can be found 
in [5,6]. 

Perfect phytogenies with the above properties are called directed since the ancestral 
state of every SNP site is assumed to be 0 or, equivalently, the root corresponds to the 
all-0 haplotype. In the undirected case the ancestral state of every site can be arbitrary 
(0 or 1). 

The two versions of the haplotype inference problem via perfect phytogenies men- 
tioned in the previous section are: 

Problem 1.1 (Perfect Phylogeny Haplotyping Problem, {0,1, 2}-PPH). 

Input: A genotype {0, l,2}-matrix A. 

Question: Does A admit a perfect phylogeny? 

Problem 1.2 (Perfect Path Phylogeny Plaplotyping Problem, {0, 1, 2}-PPPH). 

Input: A genotype {0, l,2}-matrix A. 

Question: Does A admit a perfect phylogeny that is a path? 

It is known that these problems can be solved efficiently, but become computa- 
tionally intractable if missing data is introduced [5]. Missing entries are denoted by 
?-entries in the input matrix. Given such a genotype matrix with ?-entries, we say that it 
is pp-realizable (ppp-realizable) if we can replace the question marks with values from 
{0,1,2} such that the resulting genotype matrix admits a perfect (path) phylogeny. This 
leads to the problem {0, 1,2, ?}-PPPH, whose fixed-parameter tractability is the focus 
of this paper: 
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Problem 1.3 (Perfect Path Phytogeny Haplotyping with Missing Entries, 

{0, 1, 2, ?}-PPPH). 

Input: A genotype {0, 1,2, ?}-matrix A. 

Question: Is A ppp-realizable? 

1.3 Biological Relevance of Our Fixed-Parameter Tractability Result 

The general perfect phylogeny haplotyping problem with missing entries is NP-com- 
plete. In order to solve it, one is thus forced to incorporate additional, biologically mo- 
tivated constraints on the problem. To this end, firstly, we focus on path phylogenies. 
Secondly, we assume that for each SNP site only a small fraction of genotypes lack in- 
formation about the state at this site. Thirdly, we focus on directed perfect phylogenies. 

The first assumption is motivated by the recent discovery [10] that about 75 percent 
of the human genome can be covered by yin-yang haplotypes, i.e., long-region pairs of 
haplotypes that are heterozygous at every SNP site. In the model of perfect phylogeny 
haplotyping, the presence of a yin-yang haplotype pair implies that a perfect phylogeny 
of the haplotypes necessarily has to he a path phylogeny. These hndings suggest that 
an efficient algorithm dealing with path phylogenies can already be applied to a large 
portion of the genome, leaving the remaining parts for heuristic or more time-intensive 
postprocessing. 

The second assumption is motivated by the expectation that missing data entries 
should occur only rarely and randomly and should thus not amass at a specific SNP site. 
Furthermore, a SNP site for which most entries are missing is useless for classification 
purposes: filling up the numerous missing entries is pure guesswork. 

The third assumption is a reasonable one to make since, due to the sparse distribu- 
tion of missing entries, we can usually identify the labeling of one node that necessarily 
has to occur in a perfect phylogeny; this can be done using a technique described by 
Eskin et al. [2] (see there for details). Since within each column of the input matrix we 
are free to exchange 0 and 1 labels, we can choose the labels such that this particular 
node carries a 0-entry at every position, which makes the perfect phylogeny directed. 

We have evaluated all three of our assumptions by considering publicly available 
genotype data provided by Gabriel et al. [3], available from http : / / www . broad . 
mit . edu/mpg/hapmap/hapstruc . html. The data consist of sets for four pop- 
ulations. Table 1 gives an overview on the distribution of ?-entries in the data. We see 
that the maximum number of question marks is approximately one fifth of the total 
column size but also that the average number of question marks per column is much 
smaller. We can also see from Table 1 that in all except two of the 248 investigated 
datasets we could identify one of the haplotypes, using the reduction given by Eskin 
et al. [2], thereby making the datasets suitable for our algorithm, which searches for 
directed perfect path phylogenies. 

Table 2 treats the assumption of hnding perfect phylogenies that are path phyloge- 
nies. For a given window length /, we computed for every set of / consecutive columns 
in the input genotype matrix whether these columns give rise to a perfect phylogeny 
and whether they give rise to a perfect path phylogeny. For our statistic we omitted 
rows within a window that contain missing entries. It turns out that, on average, approx- 
imately two third of the encountered perfect phylogenies are in fact path phylogenies. 
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Population 


A 


B 


c 


D 


Number of individuals (rows per genotype matrix) 


93 


50 


42 


96 


Number of genotype matrices 


62 


62 


62 


62 


Average maximum number of ?’s per column 


21.7 


10.5 


8.3 


21.3 


Average number of ?’s per column 


6.2 


2.7 


1.6 


6.8 


Percentage of directed genotype matrices 


97% 


100% 


100% 


100% 



Table 1. Statistics on the distribution of ?-entries in the human genotype data provided 
by Gabriel et al. [3]. For data from four populations, the table first lists the number of 
individuals per population whose genotypes were determined. For each individual, the 
genotypes were determined in 62 different regions of the genome. The table next lists, 
averaged over the genotype matrices for each population, the maximum and the average 
number of ?-entries per column. Finally, it lists the percentage of genotype matrices that 
can be directed. 



Population 


Window Size 


Genotype Matrices 


PP’s 


PPP’s 


RDpp 


RDppp 




5 


3029 


51% 


40% 


32% 


29% 


A 


8 


2843 


29% 


19% 


9% 


7% 




10 


2721 


20% 


13% 


3% 


1% 




5 


2900 


35% 


25% 


26% 


21% 


B 


8 


2720 


14% 


9% 


7% 


4% 




10 


2600 


8% 


5% 


3% 


3% 




5 


3001 


59% 


51% 


36% 


30% 


C 


8 


2821 


37% 


28% 


15% 


16% 




10 


2701 


27% 


20% 


9% 


9% 




5 


2819 


36% 


26% 


43% 


39% 


D 


8 


2641 


14% 


8% 


19% 


16% 




10 


2525 


8% 


5% 


7% 


6% 



Table 2. Statistics on the frequency of perfect path phylogenies in genotype data pro- 
vided by Gabriel et al. [3]. For each population we slid a window of varying length over 
the genotype matrices, resulting in the indicated number of genotype matrices that have 
as many columns as the window size. For these ‘narrow’ genotype matrices we list the 
percentage that admits a perfect phylogeny (PP) and a perfect path phylogeny (PPP). 
We also indicate what percentage of those genotype matrices admitting a perfect phy- 
logeny (RDpp) or perfect path phylogeny (RDppp), respectively, also satisfy the rich-data 
hypothesis. 



Notably, less than one third of those genotype matrices admitting a perfect (path) phy- 
logeny satisfy the rich-data hypothesis and, thus, meet the preconditions of the linear- 
time algorithm given by Halperin and Karp [7]. 



1.4 Our Contribution 

In the present paper we show that the haplotype inference problem via perfect phy- 
logeny becomes tractable if we impose the restrictions discussed before: restricting the 
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perfect phylogenies to paths, restricting the number of missing entries per column to 
a small number, and focusing on directed perfect phylogenies. Due to lack of space, 
proofs are only given in the full version of this paper. 

Theorem 1.4. {0, 1,2, ?}-PPPH is fixed-parameter tractable, where the parameter is 
the maximum number of question marks per column. 

Recall that {0, 1,2, ?}-PPPH itself is NP-complete (even if we furthermore require 
that no 1 -entries are present in the genotype matrix) [5]. The loophole, which is ex- 
ploited in Theorem 1.4, is that the genotype matrices used in the completeness proof 
are unrealistic insofar as they contain huge amounts of missing data — much more than 
any real dataset contains. 

The algorithm used in the proof of Theorem 1 .4 is composed of a powerful prepro- 
cessing and a subsequent exponential-time dynamic programming strategy. The prepro- 
cessing relies on graph-theoretic arguments and the dynamic programming makes use 
of concepts from order theory, see for example [9] for an introduction. 

Different authors, see for example [6, 4], have shown that there are numerous con- 
nections between perfect phytogeny haplotyping problems and more traditional graph 
realization problems. In [5] it is shown that {0,2, ?}-PPPH is equivalent to a special 
version of the interval hypergraph sandwich problem, which is defined as follows: A 
hypergraph is a pair H = (V,E) consisting of a vertex set V and a set E of subsets of V. 
The elements of E are called hyperedges. For a hypergraph sandwich problem we are as 
input given two hypergraphs = (V,E^) andH^ = (V,E^) such that £' = {e\, . . . ,e}„} 
and E^ = {e\,. . . ,el,} with ej C for all i G { 1 , . . . , m}. The goal is to find a hyper- 
graph H = (V,E) that is ‘sandwiched’ between and that is, E = {e\,. . . ,em\ 
with e\ C a C ej for all 1 G { 1 , . . . , m}. 



Problem 1.5 ( Interval Hypergraph Sandwich Problem ). 

Input: Two hypergraphs = (V,£^), = {V,E^). 

Question: Is there a hypergraph// = {V,E) sandwiched between//^ andH^ and a linear 
ordering of V such that each hyperedge e G £ is an interval? 

For the special case of the intersecting interval hypergraph sandwich problem all hy- 
peredges in E are required to share a common vertex. Even this restricted problem is 
NP-complete since it is shown in [5] to be equivalent to {0,2, ?}-PPPH. Therefore, our 
fixed-parameter tractability result for {0, 1 , 2, ?}-PPPH allows us to state the following: 

Theorem 1.6. The intersecting interval hypergraph sandwich problem is fixed-parame- 
ter tractable, where the parameter is the maximum, taken over all vertices v, of edge 
pairs (e ^ , with v G e^ — e^. 

2 Fixed-Parameter Algorithm for {0, 1, 2, ?}-PPPH 

In this section we present the fixed-parameter algorithm that proves Theorem 1 .4. The 
input of the algorithm is a {0, 1,2, ?}-matrix A of dimension nxmin which there are at 
most k question marks per column. 
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2.1 Structure of the Algorithm 

The algorithm proceeds in two phases. In the hrst phase, which we call the preprocess- 
ing phase, the input is hrst simplihed by collapsing multiple columns to one column if 
we can safely do so. In the second phase dynamic programming is used to compute a 
completion of the question mark entries such that a perfect path phylogeny results. 

The core idea of the preprocessing phase is the following: Suppose several columns 
become identical by resolving some ?-entries and we replace these columns with that 
one ‘consensus’ column. Clearly, if we can hnd a perfect path phylogeny for this new 
matrix, we can also hnd one for the original one. However, the reverse implication is 
also true if the number of columns that formed the consensus was large enough. 

Gusheld [6] introduced the leaf count, a ‘weight’ of a column that depends on the 
number of 1 ’s and 2’s in this column. The main beneht of the preprocessing is that there 
will be a uniform bound, depending only on k, on the number of columns that have the 
same leaf count. The edges, corresponding to columns, on a path from a leaf to the root 
in a perfect phylogeny have to carry increasing leaf counts. The core idea of the dynamic 
programming used in the second phase is to process the columns in order of ascending 
leaf counts, building a perfect path phylogeny from bottom to top. For each column, we 
consider all possible ways to resolve the ?-entries in it. For each possible resolution, we 
check whether a perfect path phylogeny exists that accommodates the resolution and, if 
so, construct it. The crucial observation, which makes dynamic programming feasible, 
is the fact that the desired perfect path phylogeny for columns up to a certain leaf count 
can be constructed from the information we stored for columns of a constant-size range 
of leaf counts preceding the current leaf count. 

In the following, we describe the two algorithm phases in more detail. The correct- 
ness proofs can be found in the full version of the paper. 

2.2 Preprocessing Phase 

Terminology. We start with some technical terminology that will be used in the descrip- 
tion of the preprocessing phase. 

An antichain in a partially ordered set (poset) P is a set of incomparable elements. 
The width of a poset is the size of its largest antichain. An antichain is maximal if none 
of its proper supersets is also an antichain. A maximal antichain is highest of cardinal- 
ity i if it has cardinality i and no element of any other maximal antichain of cardinality i 
strictly dominates any of its elements. We write hma,(P) for the set containing the 
highest maximal antichain of cardinality i, if it exists, or for the empty set, if not. Let 
hma(P) := UJli hma,(P). In the following, hma,(P) will typically be empty for i > 3, 
but it will sometimes be useful to allow for these sets to be nonempty in order to ‘catch 
errors’. 

We define a relation on the set {0, 1,2} by 1 2 0. It is extended to {0, 1,21- 

columns by setting c>c' if c[i] ^ c'\f\ for all rows i. This extended relation is a poset 
relation. The leaf count £(c) of a column c is twice the number of 1 -entries plus the 
number of 2-entries (?- and 0-entries are not counted). For a matrix A let A; denote the 
set of columns that have leaf count exactly i. Note that if c> c' then f(c) > t{c'). Thus 
the leaf count is a linear extension of the partial order ^ . 
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A (partial) resolution of a column c with ?-entries is a column c' obtained by re- 
placing (some of) the ?-entries with 0, 1, or 2. A resolution of a set C of columns with 
?-entries is a set containing one resolution of each c G C. Let res C denote the set of all 
resolutions of a set C of columns and let presC denote the set of all partial resolutions. 
Since every column has at most k question mark entries, every column has at most 3* 
resolutions. Thus |resC| < 

A consensus for a set C of columns is a column that is a partial resolution of all 
c G C. Note that two columns c and c' have a consensus iff c[i] f c'[i] implies c[i] = ? 
or c' [i] = ? for all rows i. A consensus can contain ?-entries only in rows in which all 
columns in C have a ?-entry. For an incomplete genotype matrix A and a column c (not 
necessarily in A), the dimension of c in A is the size of largest subset C of columns of A 
such that c is a consensus of C. 

Our final technical tool is the perfect path phylogeny property (ppp-property), which 
is defined as follows: Let C be a set of {0, l,2}-columns We say that C has the ppp- 
property if the following two conditions hold: First, the width of (C, is at most two. 
Second, there are two (possibly empty) chains (Ci , and (C 2 , covering (C, such 
that for each row the following holds: If some column in one of the two chains has a 
1 -entry in that row, all columns in the other chain must have a 0-entry in that row. The 
name of the ppp-property is justified by the following lemma, see [5]. 

Lemma 2.1. A {0,1, 2}-matrixA admits a perfect phylogeny that is a path iff the set of 
columns of A has the ppp-property. 

Algorithm. The objective of the preprocessing is to ensure that, during the actual al- 
gorithm, there will only be a fixed number of columns that have any given leaf count. 
Normally, if a genotype matrix without missing data and without duplicate columns has 
more than two columns of the same leaf count, the matrix does not permit a perfect 
phylogeny that is a path. The reason is that different columns with the same leaf count 
are always incomparable with respect to the partial order and thus cannot lie on the 
same path from the root to a leaf of a perfect phylogeny — and if the perfect phylogeny 
is a path, there are only two paths starting at the root. 

In the presence of missing data there can be an arbitrary number of different 
columns that have the same leaf count, but still the genotype matrix allows a perfect path 
phylogeny. The reason is that we might be able to ‘collapse’ all the different columns 
into one column by filling up the missing entries appropriately. 

The idea of the preprocessing step is to perform such collapsing, but we cannot 
just ‘collapse everything that can be collapsed’ as it may be necessary not to collapse 
columns in order to realize the matrix. We show that it is ‘safe’ to collapse columns in 
the input matrix if their number is large. From here on, we denote k(1) := 6‘ ■ i\. 

Lemma 2.2. Let Abe a genotype matrix with at most k question mark entries per col- 
umn. Let C be a set of columns in A and let £ > I be minimal such that the columns 
in C have a consensus c containing k — £ questions mark entries. If\C\> k(^), then the 
following holds: The matrix B obtained from A by replacing all columns in C by c is 
ppp-realizable iffB is realizable. 

To give a flavor of the proof, which is omitted due to lack of space, consider the 
case £= I and consider seven columns in A that have a consensus c. We are done if 
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there is a ppp-realization of A that contains the consensus c. So suppose that this is not 
the case and that the realizations of the seven columns of A differ from the consensus. 
Since l=\, for each of the seven columns there is exactly one row where the column 
differs from the consensus, namely by having a ?-entry in that disputed position. 

We extract the rows containing disputed positions. Since any two of the seven 
columns are different, every disputed position must be in a different row. Thus we ex- 
tract at least seven rows. For any seven rows, at least three rows must have a 0-entry, 
a 1 -entry, or a 2-entry in the consensus. We extract these three rows and sort them ac- 

X x\ 
xl x] 

X X 1 J 

with X G {0, 1,2}. This matrix, and hence the original one, is not ppp-realizable, unless 
one of the ?-entries is resolved by x. 

The rest of the lemma is proved is proved by induction on £. For the inductive step, 
we use graph-theoretic arguments on a graph defined on the columns of the genotype 
matrix A and combine them with a contradictory argument similar to the one just given. 

While Lemma 2.2 allows us to perform collapses whenever numerous columns have 
a common consensus, it is not clear how we can find these columns efficiently. Given a 
genotype matrix A in which every column has at most k question marks, we call a set C 
of columns from A a collapsible set if it meets the requirements of Lemma 2.2, i.e., if 
there exists a consensus c with exactly i question mark entries and if |C| > k(T). The 
following lemma states that a collapsible set of columns can be found efficiently. 

Lemma 2.3. Given an n x m genotype matrix A with at most k question mark entries 
per column, we can, in time 0{4^m^n) and space 0{mn), find a collapsible set C of 
columns with its corresponding consensus c or determine that no such set exists. 

In the preprocessing step we remove duplicate columns and then invoke the algo- 
rithm from Lemma 2.3 as often as possible. As long as the algorithm outputs some set C 
and a consensus c, we collapse the columns in C, remove duplicates, and repeat. The 
resulting preprocessing is summarized in Figure 2. 

The essential property of the preprocessed matrix is that Lemma 2.2 ‘does not ap- 
ply’ to it. Thus the dimension of any column c in it is at most K{k) = 6^ -kl. This allows 
us to sum up the effect of the preprocessing phase in the following theorem, which will 
be crucial for keeping the dynamic programming tables of our main algorithm ‘small’. 
From here on, we use X{i) := {4i + 2)k(j). 

Theorem 2.4. Let B result from A by the preprocessing algorithm in Figure 2. Then B 
is ppp-realizable iff A is. Furthermore, if more than fifk) columns in B have the same 
leaf count, then both A and B are not ppp-realizable. 



Running time. By Lemma 2.3, hnding a set of columns that can be collapsed (realized 
inside the repeat-loop in Figure 2) can be done in time 0{4^m^n). Since we can perform 
at most m collapsing steps before running out of columns, the preprocessing algorithm 
will stop after at most 0{4^m^n) steps. 
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Input: Genotype matrix A with at most k question mark entries per column. 
Output: Genotype matrix B in which every column has dimension at most K(k) and 
which is ppp-realizable iff A is ppp-realizable. 



B^A 

repeat 

for f ^ 1 to k do 

foreach column I; of B do 
foreach p g pres{I>} do 

if p has k — £ many ?-entries then 

C {c I c is a column of B and a resolution of p} 
if |C| > k(/) then 

B ^ B with all columns in C replaced by p 
break for loop 
until B remains unchanged 

return B 



Fig. 2. The preprocessing algorithm 



2.3 Dynamic Programming Phase 

Terminology. In the following, we introduce terminology for the description of the 
dynamic programming table. The input for the dynamic programming phase is a pre- 
processed nx m matrix B. Observe that the leaf counts of the columns of B range be- 
tween 0 and some number L < 2n. During the execution of our algorithm, for increasing 
numbers i G {0, ■■■ ,Lj we consider the set B, of columns of B that have leaf count i. 
By Theorem 2.4, if the size of Bi is larger than X{k), we can output that no realizable 
resolution exists. 

In the following, we dehne sets R , that, intuitively, contain all ppp-realizable can- 
didate resolutions for the columns whose leaf count ‘could become’ i. Consider the 
union of tho sets Bj^ 2 k, Bi^ 2 k+\, ■■■, Bi (we assume that Bj := 0 for nega- 
tive j). Thus, contains all columns whose leaf count, after resolution of the 

question marks, could become i. With Theorem 2.4, the cardinality of B[,_ 2 /t,i] is at 
most {2k+ l)X{k). We require that a set B G resB[;_ 2 jt,!] is in R , if (B, has the ppp- 
property: 

B , := {B I B G ie:sB^i_ 2 k,i],R is ppp}. 

We can bound the size of R , by bounding the size of resB[,_ 2 t:,!]- ®ach question mark 
in a column in B[,_ 2 t:,!] can be resolved in three different ways and there are at most 
k(2k+ 1) • X{k) question marks altogether in B[,_ 2 jt,!]- Thus, |B ,| < 

We next extend our notion of candidate resolutions which are so far defined for a 
set of columns having a limited range of leaf counts. We introduce sequences of these 
candidate resolutions covering all columns having leaf counts up to i. Let us call a 
sequence (Bq, . . . ,B,) G B o x • • • x B , consistent if every column of B is resolved in 
exactly the same way in all Rj in which it occurs. Let 
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Input: Genotype matrix A with at most k question mark entries per column. 
Output: Statement whether A admits a perfect path phytogeny. 

B ^ preprocess A 

for i ^ 0 to L do 

if \Bi\ > X{k) then output ‘A has no realizable resolution’; stop 
j ^ min{7 I By / 0} 
foreach Rj e R j do 

if Rj is ppp then H {Rj) ^ {hma(By)} 

for i ^ 7 + 1 to L do 
/ ©then 

j ^ max{; | y < i and is not empty} 

foreach B, e R i do 

H (Ri) = {hma{Hj U | By G B j, {Rj,Ri) is consistent, 

Hj e H (By), By U [B,]-' is ppp} 
if H {Ri) / 0 then output ‘A admits a perfect path phytogeny’ 

else output ‘A does not admit a perfect path phytogeny’ 



Fig. 3. Main algorithm. Using standard dynamic programming techniques, the above 
algorithm can be modified to output a realizable resolution of A instead of just deciding 
whether such a resolution exists. 



R <i := {(Bo, . . . ,Ri) € R qx ■ ■ ■ xR i \ (Bq, . . . ,B,) consistent, BqU • • • UB; is ppp}. 

Intuitively, each (Bo,...,B,) G R <, (which does not contain ?-entries) is a possible 
‘consistent’ realization of columns in Bo,...,B, (which may contain ?-entries). The 
columns in Bo U • • • U B,, however, may have leaf counts larger than i, namely up to 
i + 2k. Being interested in ‘filtering out’ those columns that actually have a certain leaf 
count, for a set B of columns we define the set [R]^-' to contain the those columns in 
B that have leaf count j. In the same way we define [B]-' and [B]^T Thus, we use 
subscripts to refer to the leaf count before replacing question marks, and superscripts to 
refer to the leaf count after the resolution. 

Our final technical definition concerns the highest maximal antichains of the column 
sets. For B, G B , we dehne: 



H {Ri) := {hma ([Bo U • • • UB,]^') |(Bo, . . . ,B,) G B <,} . 



Algorithm. The algorithm uses dynamic programming. The employed dynamic pro- 
gramming table has one column for every leaf count ranging from 0 to L. The column 
corresponding to leaf count i has one entry for every B, G R i. The table entry cor- 
responding to Ri G B , is defined to contain //(B,). The dynamic programming step 
computes H (B;) based only on Hj{Rj) for all By G B y, where j < i is maximal such 
that By_ 2 k.j] is not empty. Figure 3 shows pseudo-code for solving {0, 1,2, ?}-PPPH. 
Proving the correctness of this algorithm is deferred to the full version of the paper. 
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Running time. The algorithm iterates L<2n times. In each iteration it computes H (/?, ) 
for each Ri G R i. For computing H (Ri), we consider each R j and, for each of these, 
each Hj G H (Rj). We know that both \R , | and \R , | are hounded by 3<^(2^+i)^W (see 
definition of R The size of Hj can be bounded by . 2n. The test whether 

(Rj,Ri) is consistent and the test whether U /?, has the ppp-property can both be done 
in time In summary, the running time is 0{A^m^n + 20 (P- 6 '‘-k\) including 

the preprocessing. 

3 Conclusion and Open Problems 

We have shown that the haplotype inference problem via perfect phytogeny in the pres- 
ence of missing data becomes feasible if the input data meets two conditions: the num- 
ber of missing entries per SNP site is small and the perfect phytogeny is a path. An 
analysis of publicly available data shows that these requirements are often met by ge- 
nomic data available for humans. 

While, admittedly, the factor in the running time of our algorithm that depends on 
k grows quickly with k, the estimate that we give is a worst-case estimate that might be 
far too pessimistic in real instances. 

Two main open questions remain concerning the presented problem parameteriza- 
tion, namely by the maximum number of missing SNP states per SNP site. First, is the 
undirected version of the perfect path phytogeny problem hxed-parameter tractable? 
Second, and more importantly, is the perfect (possibly non-path) phytogeny case fixed- 
parameter tractable? 
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Abstract. We give simple, self-contained proofs of the basic hardness 
results for the classes W[t\ of the weft hierarchy. We extend these proofs 
to higher levels of the hierarchy and illuminate the distinctions among 
its classes. The anti-monotone collapse at W[l, s] and the normalization 
of weft-t formulas arise as by-products of the proofs. 



Introduction 

The theory of fixed-parameter hardness and the IF-hierarchy has been obscured 
by technicalities of the definitions of the classes of the VF-hierarchy and by the 
difficulty of the known proofs of the basic completeness results. We present a sig- 
nificantly simpler formulation and proofs of basic completeness results for ], 
which eliminate many of the technicalities. The new proofs closely follow the de- 
velopment of the theory of AP-completeness, greatly simplifying the complex 
manipulations of circuits and eliminating the ancillary graph-theoretic problems 
of the original proofs. 

Starting from Chen, Flum and Grohe’s [4] characterization of W\t], we 
present self-contained and relatively straightforward proofs that 

1. Weighted c-SAT g W[1] for all constants c > 2, 

2. Short NTM Acceptance is hard for W[l], and 

3. Short NTM Acceptance is reducible to Weighted 2-SAT via a reduc- 
tion that produces only anti-monotone formulas. 

Therefore, an initial treatment of ]-completeness can follow entirely classical 
lines, closely mimicking the standard proofs that SAT is AP-complete: 

1. SAT G NP, 

2. A Turing machine can simulate a log-cost random-access machine in poly- 
nomial time, and 

3. CNFSAT is AP-hard. 

Along the way, we obtain the restriction to anti-monotone 2-SAT formulas as a 
simple observation. 

The full impact of the new formulation arises higher in the IF-hierarchy. We 
present new proofs that 

* Both authors supported in part by a grant from the Natural Sciences and Engineering 
Research Council (NSERC) of Canada. 
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1. Weighted WEFT-t-DEPTH-d SAT G W[i\ for all constants d>t>2, and 

2. Weighted ^normalized SAT is hard for W[i\. 

Our new proofs, unlike the former ones, are straightforward extensions of the 
W[l] case. Moreover, the full Normalization Theorem and the Monotone and 
Antimonotone Collapse Theorems arise as simple corollaries of the proofs. 

Finally, we abstract the Chen-Flum-Grohe characterization to a more general 
setting, comparable to but different than the “guess-then-check” classes of Cai 
and Chen [1] . The new viewpoint illuminates the apparent disconnection between 
alternation and space in parameterized complexity [4] . 



1 Background 

1.1 Parameterized Problems and the Weft of Circuits 

We briefly review the definitions of parameterized problems and the weft of a 
circuit; for further history and discussion, see Downey and Fellows [6]. 

Definition 1. A parameterized problem is a set of pairs Q C E* x F* over 
finite alphabets E and F. For each (x, k) G Q, x is taken as input and k is taken 
as the parameter. A parameterized problem is fixed-parameter tractable if there 
is an algorithm that can determine whether {x,k) G Q using time f{k)p{\x\) for 
some computable function f and polynomial p. 



Definition 2. An FPT-reduction from a parameterized problem Q C E* x F* 
to a parameterized problem Q' C (£■')* x (T')* is a mapping R: E* x F* 
{E')* X {F')* such that 

1. For all (x, k) G E* x F* , {x, k) G Q ^ R{x, k) G Q' , 

2. There exists a computable function g:IN i-^- IN such that for all (x,k) G 
E* X F* , where R{x,k) = {x',k'), we have k' < g{k), and 

3. There exist a computable function /:IN i-^- IN and a polynomial p such that 
R is computable in time f{k) •p{\x\). 

We write Q Q' if there is an FPT reduction from a parameterized problem Q 

to another parameterized problem Q' . 

The breakthrough in measuring parameterized hardness depended on a mea- 
sure of circuit (or formula) complexity called weft. Fix any constant u > 2; call 
a gate of a circuit “large” if it has fan-in more than u. 

Definition 3. Let C be a decision circuit. The weft of C is the maximum num- 
ber of large gates on any path from the inputs to the output. A circuit is t- 
normalized iff the and- and or-gates alternate, with an and-gate at the output, 
and it has depth (and hence weft) at most t. 
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Since we will deal only with circuits of bounded depth and polynomial size, 
circuits are equivalent to formulas (tree circuits) of the same depth and poly- 
nomial size. A formula in u-CNF has weft 1 and depth 2; any CNF formula is 
2-normalized. 

A central problem in the theory of parameterized complexity is the weighted 
satisfiability problem. An assignment to the variables of a circuit has weight k 
iff it sets esactly k variables to true. These following variations of weighted 
satisfiability formed the basis of original formulation of the VF-hierarchy. 

Weighted SAT 

Instance: A Boolean formula ip. 

Parameter: An integer k. 

Question: Does ip have a weight-A: satisfying assignment? 

Weighted c-SAT (for c > 2) 

Weighted SAT restricted to c-CNF formulas. 

Weighted WEFT-t-DEPTH-d SAT (for d>t>2) 

Weighted SAT restricted to formulas of weft t and depth d. 

Weighted ^normalized SAT (for t >2) 

Weighted SAT restricted to t-normalized formulas. 



Firming the theory required establishing two key “collapses” in complexity. 

Proposition 4. 

1. Weighted c-SAT Weighted 2-SAT for all constants c > 2. 

2. Weighted WEFT-t-DEPTH-d SAT Weighted ^normalized SAT, 
for d>t>2. 

Each of these intermediate results was obtained via a complicated combinatorial 
proof including an ancillary graph-theoretic problem. For the first case, the proof 
defined and used the novel Red/Blue NonBlogker problem; for the second 
case, the classical DominatingSet problem sufficed. The details of reductions 
from satisfiability to the graph problems were crucial to the proofs. Although 
the two properties seem related, their proofs were nevertheless quite distinct. 
Below, we obtain both results as direct corollaries of our proofs. Also, our proof 
for weft t > 2 is simply an extension of the proof for weft 1. 

1.2 Machines and Algorithms 

Recently,Chen, Flum, and Grohe [3,41 discovered a machine-based characteriza- 
tion of W[t]. 

Definition 5. A W-RAM is an alternating random-access machine with two 
sets of registers: standard registers ro, ri, r 2 , . . . , and guess registers go, 5i, 52 , ■ • ■ • 
Any standard deterministic operation is allowed on the standard registers. Only 
the following operations may access the guess registers: 
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Exists j: Store any natural number in the range [0,ro] in register gj, with exis- 
tential branching. 

Forall j: Store any natural number in the range [0,ro] in register gj, with uni- 
versal branching. 

JG= i j c: If gn = gr^ , then jump to instruction location c. 

JGO i j c: If ^g^ ,) = 0, then jump to instruction location c. 

Here (•,•) is any suitable pairing function. 

Limiting the access to guess registers simplifies the analysis of a computation. 



Lemma 6. If two executions of a W-RAM on the same input follow the same 
sequence of instructions, they have the same set of values in the standard regis- 
ters. 

Definition 7. A W-RAM program R is an AW-program if there are a com- 
putable function f and a polynomial p such that for every input {x, k) with \x\= n 
the program R on every run, 

1. performs at most f{k) • p{n) steps, 

2. has at most f{k) existential or universal steps, 

3. occesses at most the first f{k) ■ p{n) standard registers, and 
j. stores values at most f{k) • p{n) in any register at any time. 

We shall treat the following result of Chen, Flum and Grohe [4] as the defi- 
nition of the classes W[i\. 

Proposition 8. Let Q be a parameterized problem and t > 1. Then Q is in 
W[t] if and only if there are a positive integer u, a computable function h and 
an AW-program R for a W-RAM such that R decides Q and such that for every 
run of R on an instance (x, k) of Q as input, 

1. all existential and universal steps are among the last h{k) steps of the com- 
putation, 

2. there are at most t — 1 alternations between existential and universal states, 
the first guess step is existential, and 

3. every block without alternations, besides the first one, contains at most u 
guess steps. ^ 

Simple algorithms suffice to put some problems in the FF-hierarchy. 

Lemma 9. 

- The problems IndependentSet, Clique, Short NTM Acceptance and 
Red/Blue NonBlocker are in W[l]. 

- DominatingSet is in W[2]. 



^ Omitting this condition gives apparently different classes denoted A[t] [7]. With 
hindsight, one may consider that the difficulty of formulating this condition in terms 
of Turing machines or circuits lies at the heart of the difficulty in creating the original 
formulation of parameterized hardness. 
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Proof. For example, to determine whether a graph G has an independent set of 
size k, first construct the adjacency matrix Aq of G, using time O(n^). Then, in 
0{k^) further operations, guess k vertices and check that all of the corresponding 
entries in Aq are 0. 

The other algorithms are similar. □ 

2 The Complexity of Weighted Satisfiability 

2.1 AW-programs 

Algorithms placing the variants of weighted satisfiability in the IT-hierarchy are 
somewhat more complex than those given earlier, but follow the same basic plan. 



Lemma 10. Weighted 2-SAT is in W[l\. 

Proof. The algorithm considers three categories of clauses. For each category, it 
creates a separate look-up table for the properties required by clauses of that 
category. It then guesses a single weight-A: assignment and checks that the as- 
signment satisfies the conditions of each table. 

Anti- monotone clauses: {^Xi V ~^Xj). 

These clauses are equivalent to an instance of IndependentSet, with one 
node for each variable and an edge between two variables appearing in the 
same clause. The look-up table is simply the adjacency matrix. 

Implications: (^XiVxj). 

We interpret these clauses as Xi xj. For each variable Xi, the algorithm 
computes and stores the set of variables S'i = { Xj \ (^Xi V xj) appears in }. 
Once an assignment A is guessed, for each variable Xi set to true by A, it 
looks up the corresponding Si, and checks that each variable in Si is also 
true. These checks require O(k^) steps for a weight-A: assignment. 
Monotone clauses: {xi V Xj). 

If some variable Xi appears in more than k distinct monotone clauses, it 
must be set true in every weight-A; satisfying assignment. The other clauses 
contain only variables that occur at most k times; if they are simultaneously 
satisfiable with weight k, there must be at most A;^ such clauses. 

To check satisfaction by an assignment, the algorithm checks that each re- 
quired variable is true and that the remaining true variables satisfy the k^ 
other clauses. 

After creating the data structure for each category of clauses, R nondeter- 
ministically guesses k variables to be true and accepts iff the guessed set of 
variables satisfies all three constraint structures. This check uses 0{k^) steps, as 
required. □ 

The above monotone and anti-monotone clauses may also be considered as 
implications, of the respective forms {true (xi V Xj)) and ((xi A xj) false). 
We generalize the idea to an algorithm solving Weighted c-SAT, for any con- 
stant c, and then further to higher weft. 
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Lemma 11. For all c>2, Weighted c-SAT g W[1]. 

Proof. For each 0 < r < c, consider the clauses with r negated variables: 
(El=i ^j) or equivalently, ((OLi x^) Ej=r+i Regroup all 

clauses with the same implicant together, producing an equivalent formula 

C / ^ 

n n f ^ 

r=0 \ i=l 

where the second product is over all size-r subsets of the variables, and each 
is a monotone (c — r)-CNF formula. 

For each value of r < c, the algorithm constructs a table of size (") , with one 
entry for each subset of size r. The entry corresponding to contains 

all information needed to check whether a guessed assignment satisfies ipx[...x'^- 
Since this formula is monotone, the data size and checking size are bounded by 
a functon of k and c, regardless of n. □ 

Lemma 12. Weighted WEFT-t-DEPTH-d SAT G W[t], for d>t>2. 

Proof. Fix any constant d, and let (phe & weft-t, depth-d tree circuit with nega- 
tions only at the inputs. The algorithm has three main phases: simplifying the 
form of the circuit, constructing tables to enable fast truth-checking, and finally 
evaluating the circuit at guessed values using alternation. 

1. Simplify the form of the circuit, to have (starting from the output) at most 
two levels of small gates, followed by t levels of alternating large gates (no 
small gates), and then one level of small gates. The algorithm proceeds from 
the output of the circuit to the inputs, locally restructuring the circuit so as 
to combine each small gate with a neighbouring large gate.^ 

2. Construct satisfaction-checking tables (as in the algorithm for CNFSAT 
above) for the large gates closest to the input literals. 

3. Guess k variables to be true, find inputs to t — 1 levels of large gates using 
alternation, and finally check the satisfaction of the input gates using the 
tables constructed in phase 2. 

Simplifying the Form of a Circuit. Let a “small sub-circuit” be a maximal 
connected sub-circuit comprising only small gates. Since each small sub-circuit 
has gates of fan-in two and has depth at most d, it receives at most 2'^ “input” 
values (each either a literal or the output of a large gate). Therefore, its CNF 
and DNF equivalents each have size at most 2^ , which is a constant for our 

^ This process is essentially the one used as a (small) part of the original proof of 
the Normalization Theorem ([6], p. 293). The tables of phase 2, combined with the 
alternation of phase 3, replace the multiple changes of variables and bypass the 
DominatingSet problem entirely. Our following proof of the lT[t]-completeness of 
Weighted WEFT-t SAT will complete the normalization. 
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purposes. We will select one of these equivalents depending on the surrounding 
circuit. 

As a first step, convert the small sub-circuit (if any) at the output either to 
CNF or to DNF. Next consider a large gate G whose output is an input to this 
sub-circuit. We shall assume for simplicity of description that G is an and-gate. 
(If G is an or-gate, apply the De Morgan dual of the following.) Each input 
Gi of G comes from another small sub-circuit Si. Convert each St to its CNF 
equivalent S[, and merge the output gates of each S[ into G, forming a larger 
and-gate G' . 

Each or-gate Sij of each S[ has at most 2'^ gates as inputs. Those that are 
or-gates may logically be merged with Sij, but to do so would increase the weft 
if Sij also has a large and-gate as input. To circumvent this problem, consider 
the partial circuit comprising gate and all of its large and-gate inputs. (The 
inputs to this partial circuit come from or-gates, small sub-circuits, and literals.) 
The partial circuit is in DNF form, with a constant number (2*^) of disjuncts. 
Therefore, its CNF equivalent circuit Gij has size Replace the partial 

circuit by Gij and finally merge the output of Gy with the original gate G. 

At this point the gate G has only large gates and literals as inputs. Move 
to the next level of large gates, and modify them in the same manner. Upon 
reaching the input variables, the new circuit has the required form. 



Creating the Tables for Satisfaction-Checking. Following the simplifica- 
tion, each depth-2 circuit at the inputs, comprising a single large gate whose 
inputs are small gates with literals as inputs, is in either 2'^-CNF (for a large 
and-gate) or 2‘^-CNF (for a large or-gate) form. Each of these depth-2 circuits 
receives its own tables for checking satisfaction. In the CNF case, the tables are 
those from the previous algorithm. The DNF case requires a further technique. 

For the DNF case, in addition to using variables as indices into the tables, 
we introduce a set of additional indices, Y = {yij \ 1 < t < j < A:}, where j/y 
signifies that all variables Xi, ior i < £ < j, are set false. ^ Let Z be any consistent 
set of at most 2‘^ indices from X UY. The entry correponding to Z in the table 
for a large or-gate G is set to 1 if any single conjunct is made true by Z (which 
guarantees that any assignment consistent with Z satisfies G). 



Evaluating the Circuit. To check satisfiability of the resulting weft-t formula 
by a weight-fc assignment, existentially guess both the assignment and the val- 
ues of the small monotone sub-circuit at the output. Proceed toward the inputs, 
guessing an input line for each large gate (existentially for an or-gate and uni- 
versally for an and-gate), until an input depth-2 circuit is reached. Finally, check 
that the required conditions hold: each large gate computes its output correctly, 
the output sub-circuit produces true, and the truth-checking tables justify the 
guessed values. In the case of an or-gate, it is necessary to check all consistent 

® These indices correspond to the “gap variables” of the original proof. They are the 
only remnant of the DominatingSet problem. 
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combinations of indices; since there are ^ such combinations, the required 
bounds are met. □ 

2.2 Hardness for 

For the hardness results, we use the Short NTM Acceptance problem. Two 
lemmas provide the connections to satisfiability and to AW-programs. 

Lemma 13 (Chen, Flum). Short NTM Acceptance is hard for 1F[J]. 

Lemma 14 (Cai, Chen, Downey, Fellows [2]). 

Short NTM Acceptance Weighted 2-amSAT. 

Together with the above algorithm for Weighted c-SAT, these yield a new 
proof of the “Cook’s Theorem” for 1F[J]. 

Theorem 15. The problems Short NTM Acceptance, Weighted c-SAT 
(for any fixed c>2) and Weighted 2-amSAT are complete for W[l]. 

Well-known reductions then give 

Corollary 16. IndependentSet, Clique, and Red/Blue NonBlocker are 
W\l]- complete. 

We reprise the original proofs of the lemmas (cf. [6]) in order to later extend 
them higher in the weft hierarchy. 

Proof, (of Lemma 13.) Let Q G W\l] be a paramaterized problem solved by 
AW-program R, using f{k)p{\x\) steps and ending with h{k) guess steps, on any 
input {x, k). (Hereinafter we shall omit the arguments of /, p, and h.) The desired 
reduction, on input (x,k), will produce a nondeterministic Turing machine M 
that accepts the empty input in k steps if and only if {x, k) G Q. M will have 
alphabet S = [f ■ p]; that is, one symbol for each possible value of a register 
of R. The states of M will encode {k + l)-tuples {S,ii . . . ,ik), where S is an 
instruction of R and i\,. . . ,ik are indices of registers. 

The start state of M encodes the configuration of R{x) reached just before 
the first guess step. M begins by guessing the h values for the existential registers 
of R, writing one guess to each of h cells. It then checks, in hf steps, that R 
would indeed accept with these guesses. Thus M accepts the empty input in 
hf{k) steps iff R accepts (x,k), as required. □ 

Proof, (of Lemma 14.) Let M have state-set Q and alphabet S (including blank). 
Without loss of generality we assume that M has a unique halting state (/yes, 
that M reaches this state only at the left end of the tape, and that once M 
enters state Pyes it remains in the same state and cell indefinitely. 

Consider the tableau associated with a k-step computation of M; it contains 
k rows and k columns, for a total of k^ entries. The contents of the tableau are 
described using variables Ytphra, where I < t < k, I < p < k, h < k, t G 
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Q, and a G S. The reduction produces a formula tp that is satisfied by a 
weight-A:^ assignment A of the variables if and only if there is a computation 
of M such that for all t, m, h, q and cr, variable Ytmhqa has value true in A 
exactly if at time t, position p of the tape contains symbol cr and the head scans 
position h in state q. 

To obtain anti-monotone 2-CNF formulas, we do not directly require consis- 
tency among the parts of the computation; instead, we forbid inconsistency. All 
clauses will have the form V ~^x. For example, the condition that at most one 
state occurs at time t is expressed by the conjunction 

n n n n {,~AYtphqc7 V 'Yip' h' q' 

(T,a'^S <]7^Q'^Q 

Similar conjunctions enforce the other constraints. We leave them to the reader. 

The condition that some combination of state, head position and cell contents 
must hold at each step is not directly expressed by the formula. This condition 
is enforced by the consideration only of assignments of weight in combination 
with the expressed prohibitions. 

The entire formula has size f'{k)p'{n) and can be computed in proportional 
time. It has a satisfying assignment of weight h{k)^ if and only if R accepts input 
(x, A:), as required. □ 

Looking ahead, we note that one can substitute monotone CNF formulas 
(with unbounded conjuncts) for the anti-monotone 2-CNF formulas of the proof. 
For example, the condition that at least one state occurs at time t is expressed 
by the formula 

n n n X! ^tphqa 

l<p<fc q^Q 

With all subformulas modified in this fashion, the restriction to assignments of 
weight h{k)'^ ensures that multiple states do not occur. 



2.3 Hardness for W[t], t > 2 

To translate the above results to W\t] for A > 2, we combine the proofs of 
Lemmas 13 and 14. The key additional technique is to handle all of the guess 
steps after the first existential block deterministically, as part of the reduction 
itself. In other words, for every configuration that could occur at the end of the 
first existential block, the reduction computes the conditions under which the 
machine will eventually accept from that configuration and incorporates them 
into the reduction formula. 

Theorem 17. Fort > 2, Weighted A-normalized SAT is complete for W[i\. 



Corollary 18 (Downey, Fellows [5]). DominatingSet is W[2]-complete. 
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Proof. Let AW-program R accept Q in time f{k)p{n), with guess steps only the 
last h{k) steps and occurring in t alternating blocks, with an existential block 
first and all blocks after the first limited to u guesses. We consider a computation 
of R to have three parts: (1) the deterministic part, up until the first guess step, 
(2) the existential part, comprising the steps from the first guess step up to the 
first universal step, and (3) the decision part, comprising the remainder of the 
computation. We handle the first two parts by combining the constructions of 
Lemmas 13 and 14, producing a set tpE oi clauses that mimic the computation 
of R. However, we omit the clauses that require the (virtual) Turing machine to 
end in an accepting state. We will replace the omitted clauses with a formula 
(fiA that requires part 3 of i?’s computation to be accepting, as given below. As 
noted above, tpE can be taken in either monotone or anti-monotone form, as 
required. 

For simplicity, we start with the case t = 2; that is, part 3 comprises a single 
universal block with u guesses. For each possible outcome G = {gi,g 2 , ■ ■ ■ ,5«) 
of the universal guesses and each possible branch b determined by outcome of 
the tests involving the earlier existential guesses, the reduction will produce a 
formula ac.fc described below. With any assignment to (pE that represents a valid 
computation of the existential part, the formula ac,b will evaluate to true if the 
universal choices specified by G lead to acceptance along path b or if some path 
other than b is followed, and will evaluate to false if the universal choices given 
by G lead to rejection along path b. Thus the conjunction of all formulas ac,b 
gives the required acceptance condition for the computation of R. Since there 
are (fp)^ possible outcomes of the u universal guesses and 2^ possible branches 
in h steps, the overall formula meets the required bounds on size. 

If branch b ends by accepting, using aob = true suffices. The case that b 
rejects requires a non-trivial formula, based on sub-formulas for each test in- 
struction T on branch b that involves an existential register. Each subformula 
will evaluate to true if test T is satisfied. For a given test instruction T, we 
shall actually define four different formulas. The formulas ttt and ut will be 
satisfied if T is taken, and tt-.t and a-,r will be satisfied if T fails. The tt for- 
mulas will be anti-monotone products-of-sums and the a formulas monotone 
sums-of-products . 

Let Yji (in full, YkjHqyes ) be the variable that is true iff register j contains 
value i at the end of part 2 of the computation by R. Note that when ipE is 
satisfied by an assignment of the correct weight, exactly one Yji is true for each 
value of j . Thus we define the subformulas ut and ttt to be correct given this 
condition, as follows. 



T is JG= i j c : 

If i is universal and j existential, let 

= Yjg, 



= ~^Yii 



and ttt 
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If both i and j are existential, let 

ar = ^ hit A Yjg and ttt = ~^Yu V • 

o<e<fp o<e^i’<fp 

T is JGO i j c : 

If i is universal and j existential, let 

CTT = X! 

o<e<fp 

and 

0<i<fp l<m</p 

If both i and j are existential, let 
or = ^2 

and 

7Tt = -^Yii V -^Yji: V . 

0<t/t'</p l<m</p 

For any test T, the formulas ct^t and are obtained as the normal-form 
equivalents of ^ttt and of ^ctt, respectively. 

The formula ac,b is then ^ ct^t- Overall, the monotone CNF formula 

= nri'^'^A 

G b 

evaluates to true on a weight-h^ assignment corresponding to a part-2 compu- 
tation by R if and only if R accepts after the corresponding existential guesses. 
The size of tpA is g{k)e(n) for a computable function g and a polynomial e (which 
depend on the constants t and u, as well as functions h and /). 

At this point, we have obtained a weft of 2, but the depth is 3 due to the 
small conjunctions in the formulas To convert to depth 2, we introduce two 
new sets of variables Zed and Zede for 0 < c,d,e < fp. Let tpz be the conjunction 
of the CNF equivalents of each bi-implication 

and 

•^(*i,ii>(i2,i2><*3,i3> (hilii A Yi^j^ A Yi^j^) . 

In formula (pA, replace each compound disjunct of each acb by the equivalent Z 
variable. 

The final formula has depth 2, and its size remains polynomial in 

n and k. Further, program R accepts x iff the formula has a satisfying assignment 
of weight h"^ + (^2 ) + ( 3 ) ■ 
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For higher values of t, the same idea applies, except that the a formulas are a 
depth-(t — 1) function of the cr or tt formulas, giving an overall depth (and weft) 
of t. The formulas can be taken as monotone (using the tt subformulas) for even 
values of t, and anti-monotone (using the a subformulas) for odd values of t. □ 

3 Remarks 

Considering only the characterization of W[i\ by AW-programs, one may rea- 
sonably ask whether these classes really represent the “right” definition of hard- 
ness classes. The limited use of the guess registers and the restriction to a con- 
stant number of guesses in each block after the first seem rather arbitrary. To 
date, however, the classes A[i\ and L[t] obtained by not imposing these restric- 
tions [3,7] do not seem to have natural compete problems of independent inter- 
est. In contrast, W[2] (for example) has importance as the class of languages 
FPT-reducible to DominatingSet. 

Cai and J. Chen [1] proposed a model of “guess-then-check” computations: 
class GC{s{n),C) contains the problems that can be solved by existentially guess- 
ing a witness of s(n) bits and then checking the witness within complexity class C. 
The AW-programs considered here use a similar but more general paradigm that 
one might call “prepare, guess, and check.” They have a preparatory determin- 
istic part, followed by existential guesses and a check of their correctness. For 
example, W[l] corresponds to parameterized polynomial time followed by a 
parametric constant number of guesses (of 0(log n) bits each) with a parametric 
constant number of checking operations (with operands of O(logn) bits). Thus 
“PG'C'(FPT, NTIME{0{\og n)))” (with a suitable precise definition) is a suitable 
characterization of the class. The classes W\t] arise from replacing the checking 
class by a version of bounded-alternation LOGTIME . 

One can also allow other classes as checking classes. Some classes give ana- 
logues of classes above W[i\, for example PGG{EPT , ATIME{0{\ogn))) or 
PGG{FPT , S PAG E {O {log n))). The former class is the same as AIF]*] [4]. The 
latter class contains the former, since ATIME{0{log n) C SPAGE{0{log n)), but 
with SPAGE{0{\ogn)) C ATIME{0{log^ n)) as our best result in the reverse 
direction, one might suspect the two to be unequal. This explains the difficulty of 
creating “Pb'PAC'A-analogue” classes as opposed to “AP-analogue” classes [4]. 
We do not know the relationship of PGG{EPT, S PAGE {O {log n))) to AFF[P] or 
to other studied classes. 
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Abstract. We propose to look at the computational complexity of 2-dimensional 
geometric optimization problems on a finite point set with respect to the num- 
ber of inner points (that is, points in the interior of the convex hull). As a case 
study, we consider the minimum weight triangulation problem. Finding a mini- 
mum weight triangulation for a set of n points in the plane is not known to be 
NP-hard nor solvable in polynomial time, but when the points are in convex po- 
sition, the problem can be solved in O(n^) time by dynamic programming. We 
extend the dynamic programming approach to the general problem and describe 
an exact algorithm which runs in 0(6’‘n^ logn) time where n is the total number 
of input points and k is the number of inner points. If k is taken as a parameter, 
this is a fixed-parameter algorithm. It also shows that the problem can be solved 
in polynomial time if k = O(logn). In fact, the algorithm works not only for 
convex polygons, but also for simple polygons with k interior points. 



1 Introduction 

A lot of NP-hard optimization problems on graphs can be solved in polynomial time 
when the input is restricted to partial k-trees, that is, graphs with treewidth at most 
k, where k is fixed. In this sense, the treewidth is regarded as a natural parameter to 
measure the complexity of graphs. This is based on the observation that “some NP-hard 
optimization problems on graphs are easy when the class is restricted to trees.” 

We try to address the following question: What is a natural parameter that could 
play a similar role for geometric problems as the treewidth does for graph problems? 
One basic observation is that “some NP-hard optimization problems on a point set in 
the Euclidean plane are easy when the points are in convex position.” Therefore, the 
number of inner points can be regarded as a natural parameter for the complexity of 
geometric problems. Here, an inner point is a point in the interior of the convex hull of 
the given point set. 

In this paper, we concentrate on one specific problem: the minimum weight trian- 
gulation problem. The minimum weight triangulation is a triangulation with minimum 
total length of edges. Triangulations have numerous applications in finite element meth- 
ods, interpolation and graphics, to name just a few. In applications one is usually inter- 
ested in finding a triangulation that is optimal in a certain sense. Among several criteria, 
a minimum weight triangulation is one of the most natural ones. 
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putation” (CGC), financed by ETH Zurich and the German Science Foundation (DEG). 
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The minimum weight triangulation problem is notorious as one of the problems 
which are not known to be NP-hard nor solvable in polynomial time for a long time [8]. 
However, when the points are in convex position, the problem can be solved in poly- 
nomial time by dynamic programming. The main result in this paper is an exact al- 
gorithm to compute the minimum weight triangulation in 0(6'^n^ logn) time, where 
rt is the total number of input points and k is the number of inner points. From the 
viewpoint of parameterized complexity [7,14] this is a fixed-parameter algorithm if k 
is taken as a parameter.^ Furthermore, the algorithm implies that the problem can be 
solved in polynomial time if k = O(logn). 

Actually, our algorithm also works for simple polygons with inner points. Or, rather 
we should say that the algorithm is designed for such objects, and as a special case, we 
can compute a minimum weight triangulation of a point set. This digression to simple 
polygons is essential because our strategy is based on recursion and in the recursion we 
cannot avoid dealing with simple polygons. 

Related work Since the literature on the minimum weight triangulation problem is 
vast, we just mention some articles that are closely related to ours. As already men- 
tioned, finding a minimum weight triangulation of a finite point set is not known to 
be NP-hard nor solvable in polynomial time [8]. For an n- vertex convex polygon, the 
problem can be solved in 0(n^] using dynamic programming. For an n- vertex simple 
polygon, Gilbert [9] and Klincsek [12] independently gave a dynamic-programming al- 
gorithm running in 0(n^ ) time. But with inner points the problem seems more difficult. 
Another polynomial-time solvable case was discussed by Anagnostou and Corneil [3]: 
they considered the case where a given point set lies on a constant number of nested 
convex hulls. As for exact algorithms for the general case, Kyoda, Imai, Takeuchi & 
Tajima [11] took an integer programming approach and devised a branch-and-cut algo- 
rithm. Aichholzer [1] introduced the concept of a “path of a triangulation,” which can be 
used to solve any kinds of “decomposable” problems (in particular the minimum weight 
triangulation problem) by recursion. These algorithms were not analyzed in terms of 
worst-case time complexity. As for approximation of minimum weight triangulations, 
Levcopoulos & Krznaric [13] gave a constant-factor polynomial-time approximation 
algorithm, but with a huge constant. As for the parameterization with respect to the 
number of inner points, the two-dimensional Euclidean traveling salesman problem was 
recently shown to be fixed-parameter tractable [6] . 



2 Preliminaries and Description of the Result 

We start our discussion with introduction of some notations and definitions used in the 
paper. Then we state our result in a precise manner. From now on, we assume that input 
points are in general position, that is, no three points are on a single line and no two 
points have the same x-coordinate. 

* K fixed-parameter algorithm has running time 0(f (k)poly(n)), where n is the input size, k is 
a parameter and f ; N — > N is an arbitrary function. For example, an algorithm with running 
time O(440'^tl) is a fixed-parameter algorithm whereas one with 0(n'^) is not. 
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The line segment connecting two points p, q S is denoted by pq. The length 
of a line segment p^ is denoted by length (p^), which is measured by the Euclidean 
distance. A polygonal chain is a planar shape described as y = Ui=o PtPt+i where 
Po, • • . ,Pt € IR are distinct points except that po and pt can be identical (in such a 
case, the chain is closed). For a closed polygonal chain we assume in the following that 
all indices are taken modulo f. 

The length of y is the sum of the lengths of the line segments, that is, length (y] = 
length(piPi+i). We say y is selfintersecting if there exists i, j G {0, . . . , f— 1}, 
i / j, such that (piPi+i npjpj + i) \ {pi,pi+i ,Pj,Pj + i} 0. Otherwise, we say y is 
non-selfintersecting. The points po , . . . , P£ are the vertices of y. When y is not closed, 
Po and p{ are called the endpoints of y. In this case, we say y starts from po (or p{). 

A simple polygon P is a simply connected compact region in the plane bounded by 
a closed non-selfintersecting polygonal chain. A vertex of P is a vertex of the polygonal 
chain which is the boundary of P. We denote the set of vertices of P by Vert(P). A 
neighbor of a vertex p e Vert(P) is a vertex q G Vert(P) such that the line segment p^ 
lies on the boundary of P. 

Following Aichholzer, Rote, Speckmann & Streinu [2], we call a pair TT = (S, P] a 
pointgon when P is a simple polygon and S is a finite point set in the interior of P. We 
call S the set of inner points of IT. The vertex set of TT is Vert(P) U S, and denoted by 
Vert(TT). Fig. 1 shows an example of pointgons. 




Fig. 1. A pointgon TT = (S, P). In this paper, the points of S are drawn by empty circles 
and the points of Vert(P) are drawn by solid circles. 

Let TT = (S,P) be a pointgon. A triangulation T of a pointgon TT = (S,P) is 
a subdivision of P into triangles whose edges are straight line segments connecting 
two points from Vert(n) and which have no point from Vert(TT) in the interiors. The 
weight of T is the sum of the edge lengths used in T. (Especially, all segments on 
the boundary of P are used in any triangulation and counted in the weight.) A minimum 
weight triangulation of a pointgon TT is a triangulation of TT which has minimum weight 
among all triangulations. 

In this paper, we study the problem of computing a minimum weight triangulation 
of a given pointgon TT = (S, P). The input size is proportional to |Vert(TT)|. For a simple 
polygon, it is known that a minimum weight triangulation can be found in polynomial 
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time [9,12]. However, in spite of the simplicity of the problem, the minimum weight 
triangulation problem for general pointgons is not known to be solvable in polynomial 
time nor to be NP-hard [8]. Our goal is to find an exact algorithm for a pointgon H = 
(S, P) where |S| is small. The main theorem of this work is as follows. 

Theorem 1. Let H = (S, P) be a pointgon. Let n := |Vert(n]| and k := |S|. Then we 
can find a minimum weight triangulation o/H in 0(6'^n^ logri) time. In particular, if 
k = 0(logn] then a minimum weight triangulation can be found in polynomial time. 

This theorem shows that, in the terminology of parameterized complexity, the prob- 
lem is fixed-parameter tractable, when the size of S is taken as a parameter. 

In the next section, we prove this theorem by providing an algorithm. 

3 A Fixed-Parameter Algorithm for Minimum Weight 
Triangulations 

First, we describe a basic strategy for our algorithm. The details are then discussed in 
Sections 3.2 and 3.3. 

3.1 Basic Strategy 

In the sequel, for a given pointgon H = (S, P), we set n |Vert(H)| and k |S|. 

An inner/7flf/j of a pointgon n = (S,P) is a polygonal chain y = Ui=o PiPt-M such 
that po, . . . ,Pf are all different, pi S S for each i € ,f— 1}, Po,Pt S Vert(P), 

and y \ {po , Pt} C P. An inner path IJ^^o pTpT+T is called x-monotone if the x-coordi- 
nates of po , . . . , P{ are either increasing or decreasing. 

The basic fact we are going to use is the following. 

Observation 2. Let H = (S, P] be a pointgon and p be a vertex o/H with the smallest 
x-coordinate. Denote byp',p" the neighbors ofp in P. Then, for every triangulation 
T ofT\, either 

(1) there exists a non-selfintersecting x-monotone inner path starting from p and con- 
sisting of edges ofT, or 

(2) the three points p , p ' , p " form a triangle ofT. 

The situation in Observation 2 is illustrated in Fig. 2. 

We would like to invoke Observation 2 for our algorithm. 

Let n = (S,P) be a pointgon, and p S Vert(P) a vertex with the smallest x- 
coordinate. A non-selfintersecting x-monotone inner path divides a pointgon into two 
smaller pointgons. (See Fig. 2(a) and recall the general position assumption.) Hence, 
by looking at all non-selfintersecting x-monotone inner paths, we can recursively solve 
the minimum weight triangulation problem. To establish an appropriate recursive for- 
mula, denote by 22 (p) the set that consists of the line segment p'p" and of all non- 
selfintersecting x-monotone inner paths starting from p. Each non-selfintersecting in- 
ner path y G 22 (p) divides our pointgon H into two smaller pointgons, say H.^ and H". 
Then, we can see that 

mwt(H) = min (mwt(n4] + mwt(H") - length(-y)). (1) 

yeT>ip) 
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Fig. 2. Situations in Observation 2. 



To see that Eq. (1) is really true, the following observation should be explicitly 
mentioned, although the proof is straightforward thus omitted. 

Observation 3. Let TT = (S, P) be a pointgon and T be a minimum weight triangula- 
tion o/TT. Choose an inner path y which uses edges ofT only, and letT]' andW" be two 
smaller pointgons obtained by subdividing TT with respect to y. Then, the restriction of 
T toT\' is a minimum weight triangulation ofW. The same holds for TT" as well. 

Therefore, hy solving Recursion (1) with an appropriate boundary (or initial) con- 
dition, we can obtain a minimum weight triangulation of TT. Note that even if TT is 
a convex pointgon, the pointgons TT^ and TT" encountered in the recursion might not 
be convex. Thus, our digression to simple polygons is essential also for the minimum 
weight triangulation problem for a hnite point set, i.e., a convex pointgon. 

3.2 Outline of the Algorithm 

Now, we describe how to solve Recursion (1) with the dynamic-programming tech- 
nique. 

First, let us label the elements of Vert(P) in a cyclic order, i.e., the order follow- 
ing the appearance along the boundary of P. According to this order, let us denote 
Vert(P] = {po,Pi , • • . ,Pn-k-i}- Then, pick a vertex pt G Vert(P), and consider a 
non-selfintersecting x-monotone inner path y starting from pt. Let pj G Vert(P) be 
the other endpoint of y. Note that Vert(Y) \ {pi, Pj) consists of inner points of TT only. 
Therefore, such a path can be uniquely specified by a subset T C S. That is, we asso- 
ciate a triple (i, j , T) with an x-monotone inner path ptqi Uqi qiU • • • U qt-i c|t U qtPj 
where T = {q i , q 2 , • • • , Tt}- For the sake of brevity we write y(T) to denote the inner 
path associated with T when the endpoints pi, Pj are clear from the context. 

For two vertices pi, Pj G Vert(P] on the boundary of TT, and a set T C S of inner 
points, let TT(i, j, T) be the pointgon obtained from TT as follows: the boundary polygon 
is the union ofthe polygonal chains piPi+T andy(T). (Notethatwe only consider 
the case where y(T) is well-dehned, that is, it does not intersect the boundary polygon.) 
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The inner points of n(i, j , T) consist of the inner points of TT contained in the boundary 
polygon specified above. Furthermore, denote by mwt(i, j , T) the weight of a minimum 
weight triangulation of the pointgon 17(1, j,T). Then, Eq. (1) can be rewritten in the 
following way if we take po for the role of p: 

mwt(T7) = 

mini min {mwt(0, i, T) + mwt(i, 0, T) — length (y(T))}, 

1 1 <i<n— k, TCS 

mwt(l,n-k-l,0, null) + mwt(rt— k-1, 1,0, null) — length (pi pn-k-i )| • (2) 

The number of values considered in the right hand side of Eq. (2) is 0((n— k)2'^) = 
0(2'^n). Hence, for the computation of mwt(n] it is sufficient to know mwt(i, j, T) for 
every triple (i, j, T] of two indices i, j G {0, . . . , rt— k— 1 } and a subset TCS. Since 
the number of such triples is 0(2'^n^), the efficient computation of each value results 
in fixed parameter tractability of the minimum weight triangulation problem. 

Nevertheless, to compute these values, we have to generalize the class of pointgons 
under consideration. That is because pointgons we encounter in the recursion might not 
be of the form Tl(i, j, T). Therefore we introduce two additional types of pointgons. 

The pointgon TT{i,j,T) is bounded by two kinds of polygonal chains: a chain 
U£=i PTPi+T from the boundary of the original pointgon and a non-selfintersecting 
x-monotone inner path y(T). Recall that T can be empty. We call such a pointgon a 
type-1 pointgon in the following. See Eig. 3(a) for illustration. 

Another class of pointgons is defined for i, j G {0,n— k— 1}, two disjoint subsets 
T,,T2 C S, and a vertex r G Vert(TT). Then, TT(i, j,T] ,T 2 ,t) is a pointgon bounded by 
the x-monotone path connecting i and r through T] , the x-monotone path connecting 
j and T through T 2 , and lj£=i PiVe+T- (Again we only consider those tuples which are 
well defined, that is, where the paths described above are indeed x-monotone and do not 
cross each other.) We call such a pointgon a type-2 pointgon of IT, and divide them into 
two subclasses according to whether r is a convex (type-2a) or reflex (type-2b) vertex 
of the pointgon. Fig. 3(b) & 3(c) illustrate the dehnition. 

The last kind of pointgons uses at most one vertex of P. For a vertex r G Vert(T7) 
and two subsets T] , T 2 C S with T] n T 2 = {s}, we define the pointgon TT(Ti , T 2 , r) 
as one which is bounded by two x-monotone paths connecting r and s through Ti and 
through T 2 , respectively. We call such a pointgon a type-3 pointgon of IT. See Fig. 3(d) 
for an example. 

Let us count the number of these pointgons. The number of type-1 pointgons is 
0(2’^n^]; the number of type-2 pointgons is 0(3'^n^); the number of type-3 point- 
gons is 0(3'^n). Therefore, the total number of these pointgons is 0(3'^n^). Our goal 
in the following is to compute the weights of minimum weight triangulations of these 
pointgons efficiently. Denote by mwt(i, j,T) the weight of a minimum weight trian- 
gulation of a type-1 pointgon 17(1, j,T). Similarly, we dehne mwt(i, j, Ti , T 2 , r) and 
mwt(T] , T 2 , r) for type-2 and type-3 pointgons, respectively. 

Before describing the algorithm, let us discuss why we only encounter these three 
types of pointgons in the recursion. Eor this, we have to be careful which vertex to 
choose as p in the recursion step. Recall that in any step of Recursion (1) there are two 
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(a) A type-1 pointgon. (b) A type-2a pointgon. 




(c) A type-2b pointgon. 




Fig. 3. The three types of subpointgons of TT. The vertex r can be either a solid or an 
empty circle. 



cases: either p is cut off by joining its neighbors by an edge, or the pointgon is subdi- 
vided by an x-monotone inner path starting from p. Also recall that in Observation 2 we 
required p to be the leftmost point of the pointgon. If we apply the same argument as 
in Observation 2 to an arbitrary vertex of the pointgon, in the second case there appears 
an inner path starting from p that is almost x-monotone, i.e., x-monotone except for the 
first edge incident to p. 

Initially we have a given pointgon Tf = (S, P) and choose the leftmost vertex as p. 
If p is cut off (Fig. 4(a)) the result is a type-1 pointgon where T = 0. Any x-monotone 
inner path starting from p divides the pointgon into two type-1 pointgons (Fig. 4(b)). 

When we apply Recursion (1) to a type-1 pointgon fT(i, j,T), we choose as p the 
leftmost vertex of the inner path y (T) (which might consist just of a single edge joining 
Pi and pj). If p is cut off, the result is either again a type-1 pointgon (Fig. 5(a)) or a 
type-2a pointgon (Fig. 5(b)). Otherwise, consider the vertex q on y(T] next to p. In 
every triangulation, the edge p^ must belong to some triangle. To make such a triangle 
we need another vertex, say z. Let us choose z to be such that pz is the first edge of an 
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Fig. 4. Subdivisions obtained from IT. From now on in the pictures, the vertex p is 
indicated by a larger circle. 



almost x-monotone inner path y' starting from p. If z G Vert(P), then we get a type-1 
pointgon, the triangle pqz and a type-2a pointgon when z is right of p (Fig. 5(c)), or a 
type-1 pointgon, the triangle pqz and a type-1 pointgon when z is left of p (Fig. 5(d)). 
If z G S, then we have four subcases. When z is right of p and y' ends at a vertex 
of y(T), we get a type-1 pointgon, the triangle pqz and a type-3 pointgon (Fig. 6(a)). 
When z is right of p and y ' ends at a vertex of P, we get a type-1 pointgon, the triangle 
pqz and a type-2a pointgon (Fig. 6(b)). When z is left of p and y' ends at a vertex of 
y(T), we get a type-2b pointgon, the triangle pqz and a type-3 pointgon (Fig. 6(c)). 
When z is left of p and y ' ends at a vertex of P, we get a type-2b pointgon, the triangle 
pqz and a type-2a pointgon (Fig. 6(d)). 

When we apply Recursion (1) to a type-2a pointgon 17(1, j,T] ,T 2 ,r), we choose r 
as p. If p is cut off the result is either again a type-2a pointgon or a type-1 pointgon 
(Fig. 7(a)). Otherwise, consider an x-monotone inner path starting from p. If the path 
ends at a vertex of P, we get two type-2a pointgons (Fig. 7(b)). If, on the other hand, 
the inner path ends at a vertex in S, then it subdivides the pointgon into a type-2a and a 
type-3 pointgons (Fig. 7(c)). 

When we apply Recursion (1) to a type-2b pointgon, we choose as p the leftmost 
vertex of the inner path. Since p is a reflex vertex, p cannot be cut off. So, every x- 
monotone inner path starting from p subdivides the pointgon into two type-1 pointgons 
(Fig. 7(d)). 

When we apply Recursion (1) to a type-3 pointgon n(Ti ,T 2 ,t), we choose r as 
p. Then, no matter how we divide the pointgon by the operations in the recursion, the 
result is again a type-3 pointgon (Fig. 8). 

So much for preparation, and now we are ready to give the outline of our algorithm. 

Step 1: Enumerate all possible type-1 pointgons TT(i,j,T), type-2 pointgons 
n(i, j,Ti ,T 2 ,t), and type-3 pointgons TT(Ti ,T 2 ,t). 

Step 2: Compute the values mwt(i,j,T), mwt(i,i,T] ,T 2 ,t), and mwt(T],T 2 ,r) for 
some of them, which are sufficient for Step 3, by dynamic programming. 

Step 3: Compute mwt(n) according to Eq. (2). 
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(a) The vertex p is cut off and a neigh- 
bor of p is left of p. 



(b) The vertex p is cut off and the 
neighbors of p are right of p. 




(c) The vertex z is a solid circle and z 
is right of p. 



(d) The vertex z is a solid circle and z 
is left of p. 



Fig. 5. Subdivisions obtained from a type-1 pointgon. 



We already argued that Step 3 can be done in 0(2’^n] time. In the next section we 
will show that Steps 1 & 2 can be done in 0(6'^n^ logn) time, which dominates the 
overall running time. 



3.3 Dynamic Programming 

Now, we are going to compute the values of mwt(i, j,T), mwt(i, j,T] ,T2 ,t), and 
mwt(T] , T2, r) for all possible choices of i, j, T] , T2, r. 

First we enumerate all possibilities of i, j , T] , T2 , r. Each of them can be enumerated 
in 0(1 ) time, and each of them can be identified as a well-defined pointgon or not (i.e., 
the inner paths do not intersect each other nor the boundary) in 0(n log n) time. (Apply 
the standard line segment intersection algorithm [15].) Therefore, it takes 0(3'^n^ • 1 • 
nlogn) = 0(3'‘n'* logti] time. This completes Step 1 of our algorithm. 

Then, we perform the dynamic programming. Determine the vertex p and consider 
all possible subdivisions with respect to p as described in the previous section. Each 
subdivision replaces FT by two smaller pointgons. Then, as we saw, these two pointgons 
can be found among those enumerated in Step 1 . 

We can associate a parent-child relation between two pointgons FTi , Ff2 in our enu- 
meration: TTi is a parent of 112 if 02 is obtained as a smaller pointgon when we partition 
111 by a path starting from p (which is fixed as in the previous section) or through the 
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(a) The vertex z is right of p and the 
path y ' ends at an empty circle. 



(b) The vertex z is right of p and the 
path y ' ends at a solid circle. 




(c) The vertex z is left of p and the 
path y ' ends at an empty circle. 



(d) The vertex z is left of p and the 
path y ' ends at a solid circle. 



Fig. 6. Subdivisions obtained from a type-1 pointgon (continued). The vertex z is an 
empty circle. 



edge cutting off p. It can also be thought as defining a directed graph on the enumerated 
pointgons: namely, draw a directed edge from Tli to Tli if the same condition as above 
is satisfied. 

Observe that if TT] is a parent of IT 2 , then the number of inner points in IT 2 is less 
than that in TTi or |Ti | + IT 2 I is smaller in T12 than in TTi . Therefore, the parent-child re- 
lation is well-defined (i.e., there is no directed cycle in the directed-graph formulation). 

Now, to do the bottom-up computation, we first look at the lowest descendants (or 
the sinks in the directed-graph formulation). They are triangles. So, the weights can be 
easily computed in constant time. Then, we proceed to their parents. For each parent, 
we look up the values of its children. In this way, we go up to the highest ancestor, 
which is IT. Thus, we can compute mwt(n). 

What is the time complexity of the computation? First, let us estimate the time 
for the construction of the parent-child relation. The number of enumerated pointgons 
is 0(3’^n^). For each of them, the number of possible choices of non-selfintersecting 
x-monotone paths is 0(2'^n). For each of the paths, we can decide whether it really 
defines a non-selfintersecfing path in O(nlogn) time. Therefore, the overall running 
time for the construction is 0(3'^n^ • 2^n ■ nlogn) = 0(6'^n^ logn). 

In the bottom-up computation, for each pointgon we look up at most 0(2’^n] entries 
and compute the value according to Eq. (1). Therefore, this can be done in 0(3'^n^ • 
2^n) =0(6'^n4). 



210 



Michael Hoffmann and Yoshio Okamoto 





(b) Type-2a. The inner 
path ends at a solid 
circle. 




(c) Type-2a. The inner 
path ends at an empty 
circle. 




Fig. 7. Subdivisions obtained from a type-2 pointgon. 



Hence, the overall running time of the algorithm is 0(3'^n'* logrt -|- logn -1- 
6^ri*) = 0(6'^n^ logn). This completes the proof of Theorem 1. 

4 Conclusion 

In this paper, we studied the minimum weight triangulation problem from the viewpoint 
of fixed-parameter tractability. We established an algorithm to solve this problem for a 
simple polygon with some inner points which runs in 0(6'^n® log n) time when n is the 
total number of input points and k is the number of inner points. Therefore, the problem 
is fixed-parameter tractable with respect to the number of inner points. We believe the 
number of inner points in geometric optimization problems plays a role similar to the 
treewidth In graph optimization problems. 
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(a) The vertex p 
is cut off. 



(b) An inner 
path from p. 



Fig. 8. Subdivisions obtained from a type-3 pointgon. 



Since our algorithm is based on a simple idea, it can be extended in several ways. 
For example, we can also compute a maximum weight triangulation in the same time 
complexity. (It seems quite recent that attention has been paid to a maximum weight 
triangulation [10,16].) To do that, we just replace “min” in Eqs. (1) and (2) by “max.” 
Another direction of extension is to incorporate some heuristics. For example, there 
are some known pairs of vertices which appear as edges in all minimum weight trian- 
gulations, e.g. the |3-skeleton for some |3 and the LMT-skeleton; see [4,5,17] and the 
references therein. Because of the flexibility of our algorithm, we can insert these pairs 
at the beginning of the execution as edges, and proceed in the same way except that we 
can use the information from the prescribed edges. 

The framework proposed in this paper looks promising when we deal with the 
complexity of geometric problems concerning a finite point set on the plane. Study 
of another problem within the same framework is interesting. Recently, the traveling 
salesman problem was considered and it was shown that the problem can be solved in 
polynomial time when k = O(logn) [6]. 

The obvious open problem is to improve the time complexity of our algorithm. For 
example, is it possible to provide a polynomial-time algorithm for the minimum weight 
triangulation problem when k = 0(log^ n)? 
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Abstract. With respect to a given plane graph, G, a face cover is de- 
fined as a set of faces whose boundaries collectively contain every ver- 
tex in G. It is known that, when k is fixed, finding a cover of size k 
(if indeed any exist) can be accomplished in polynomial time. Recent 
improvements to face cover algorithms are based on the theory of fixed- 
parameter tractability and reductions to planar dominating set. A major 
goal has been to reduce the time required for branching, which is the most 
computationally-intensive aspect of fixed-parameter tractable methods. 
The fastest previously- known method for solving planar dominating set 
requires branching time 0(8^n). The main contribution of this paper is 
a direct and relatively simple 0(5^ n) face cover branching algorithm. A 
direct O(n^) face cover kernelization algorithm is also presented. 



1 Introduction 

A set of faces whose boundaries contain all vertices in a plane graph G is said 
to be a face cover for G. For arbitrary k > 0, determining whether G has a face 
cover of size k or less is AfT^-complete [4] . Algorithms are known that solve the 
face cover problem in 0{c^n) time [2, 4]. Thus, the problem is fixed-parameter 
tractable (FPT) [5]. 

Algorithms for FPT problems very often use the following two steps: (i) 
kernelization, which reduces the problem size to a function of the parameter only 
and (ii) branching, which exhaustively searches for a solution in a tree whose 
height is bounded above by the parameter (or perhaps, say, a linear function of 
the parameter). Our work further illustrates this methodology, and elucidates 
how the planarity of a graph can be exploited in this context. 
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Let us use FC{k) to denote the face cover problem with parameter k. Re- 
cent FC{k) algorithms use a reduction to the parameterized planar dominating 
set problem (PDS{k)). The two most recent PDS{k) algorithms are based on 
different approaches. One uses dynamic programming on tree decompositions, 
and can be used to solve FC{k) in 0{c^), where c is a very large constant (see 

[2] , [7], and [8])^. Another addresses branching only (without kernelization) , and 
for that phase runs in 0(8^n) time [3]. Although face cover was not mentioned 
in [3], it is believed that an FC{k) branching algorithm would follow from this 
work, and should have a run time of 0(8^n) [6]. Although the first algorithm has 
a better asymptotic time bound, an algorithm with a run time of 0(8^n) would 
be preferred in practice. To see this, note that face cover has a highly practical 
linear-time algorithm when k = 1, since this is just the recognition problem for 
outerplane graphs. Even for k = 2, however, an 0(3^®V^^^^^n -|- n^) algorithm 
does not seem very encouraging. 

The main purpose of an effective branching algorithm is to achieve a low 
branching factor in the search tree used to explore the kernel exhaustively. The 
method presented in [3] tries to achieve this objective by exploiting the existence 
of low degree vertices in a planar graph. For a planar graph G, S(G) < 5, which 
implies that we can always find a vertex that belongs to the boundaries of at 
most five faces. So, if we use the search tree technique, branching at such vertex 
introduces at most five “smaller” instances of the problem. This suggests that 
any branching algorithm for face cover should attempt to attain a (possibly 
best possible) branching factor of 5. The challenge we face is illustrated by the 
following question: after performing the first (or first few) branch operation(s), is 
there any guarantee that another vertex of degree < 5 is present in the resulting 
graph? The answer is typically “no” (as in the case of the PDS algorithm of 

[3] ). We shall show, however, that certain reduction rules can be used to modify 
the graph and always guarantee (in the worst case) the existence of a vertex 
belonging to no more than five faces that qualify for membership in the cover. 
With this we devise a direct face cover branching algorithm that runs in 0(5^n) 
time and guarantees a branching factor of 5. We also present a kernelization 
algorithm that runs in O(n^) time. 

Our algorithm is fairly simple to implement, but its analysis is rather in- 
tricate. Most of this paper is involved with the proofs of correctness and time 
complexity analysis. The notation G = {V, F) is used when referring to a plane 
graph. For a face f & F, the set of vertices appearing in the ordered tuple asso- 
ciated with / is called the boundary of / and is denoted by V{f). For a vertex 
V GV, the set of faces whose boundaries contain v is denoted by F{v). 

Face cover is a special case of the hitting set problem (HS). To see this, note 
that a face cover of G = (V,F) is a hitting set of the sets {F{u) : u & V}. 
Unfortunately, HS{k) is not FPT unless we fix the number of elements allowed 
in each of the input sets. This restricted form of HS{k) is denoted by d-P[S{k), 
where d is the (fixed) upper bound on the size of input sets (see [10]. Thus, a 
d-HS algorithm can be used to solve the face cover problem when the degree of 



® c was 3®® in [2]. 
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any vertex of the input plane graph is bounded by d. We shall, however, deal 
with the general face cover problem. Moreover, preprocessing techniques used 
for the general HS algorithm in [12] apply well to FC{k) instances. 



2 Preliminaries 



We shall assume that our input is like the input of a, HS algorithm. In fact, this 
is why we chose to characterize plane graphs by vertices and faces. The data 
structure used for a plane graph consists, essentially, of two lists corresponding 
to vertices and faces of the graph. The list associated with a vertex v starts with 
the number of faces in F{v) then a list of the indices of these faces. Similarly, the 
list associated with face / contains the number of vertices in V (/) followed by the 
indices of elements of V (/). Figure 1 shows a plane graph and the corresponding 
input to our FC{k) algorithm. 



fo 




F(O): 3; 0,1,2 


V(0): 4; 0,1,2,3 


F(l): 2; 0„2 


V(l): 4; 0,4,2,3 


F(2): 3; 0,1,2 


V(2): 4; 0,1,2,4 


F(3): 2; 0,1 




F(4): 2; 1,2 





Fig. 1. A plane graph and the corresponding input representation 



The number of vertices of the given plane graph is denoted by n. When 
dealing with graphs, the size of input is often quadratic in n. This is due to the 
popular adjacency matrix representation. In our case, the graph is plane and, 
thanks to Euler’s formula, has a linear number of edges and faces. We show that, 
according to our representation, the size of input is linear in n. 

Lemma 1. Let G = (V,F) be a given plane graph, and let V{f) and F{v) be 
as defined above. Then the sets {F{v) : u G F} and {V{f) : f G F} have linear 
size. 

Proof. We fisrt note that |{F(u) : v G V}\ = \{V{f) : f G F}|. To see this, 
consider the bipartite graph H = {A, B) whose vertex sets A and B are the 
vertices and faces of G respectively. An edge of H connects a vertex u of A 
to a vertex f of B if and only if / G F{v) (if and only if u G V{f)). Thus 
we have \E{H)\ = |{F(z;) : v G V}\ = \{V{f) : f G F}|. Moreover, for each 
vertex v of G, |F(u)| is bounded above by the degree of v in G. It follows that 





216 Faisal N. Abu-Khzam and Michael A. Langston 



|{F(z;) : u G y}| < Sy^ydoiv) = 2|_E(G)| < 2(3n — 6)^. This proves that \E{H)\ 
is of linear size, which completes the proof. 

We use the bounded search tree technique in our FC{k) algorithm. During 
the search process, the vertex set is partially covered by the already selected 
faces. We shall, then, reduce the graph at each step by eliminating covered 
vertices. While this action is easy (and safe) when dealing with a general HS 
instance, it must be performed carefully in our case. Especially because we need 
to be assured that every node in the search tree has at most 5 children. More- 
over, deleting a covered vertex from a plane graph might change the size of an 
optimal face cover and produce incorrect results. As an example, consider the 
graph shown in Figure 2. If the algorithm selects the outer face and deletes all 
its vertices, the resulting graph has a face cover of size one, while the original 
graph has a minimum face cover of size three. Our algorithm deals carefully with 
covered vertices. Before deleting a covered vertex, v, we modify all faces contain- 
ing it by, simply, deleting v from their (ordered) lists. This face compaction will 
be called “the surgical operation” later in this paper. It is depicted in Figure 3. 

2.1 Annotated Plane Graphs Representation 

For a given plane graph G = (V, F), vertices and faces are of two types: active 
and marked. Active vertices are those to be covered (i.e., not covered yet) and 
active faces are those that can be used to cover active vertices. A plane graph 
with active and marked vertices and/or faces will be called annotated in this 
paper. 

A general version of the face cover problem was presented in [4], where not 
all vertices of the graph are to be covered. Our algorithm will be ready to deal 
with this version as well. In fact, if a vertex is not to be covered, then we may 
assume that it has been covered earlier during the process of finding the face 
cover and it will be marked. 




Fig. 2. Deleting the vertices that are covered by selecting the outer face produces 
a wrong answer. 



Marking a face / is done by deleting V (/) after replacing the index of / by 
-1 in the list(s) F(v) of each vertex v € V(f). This process takes 0(|{F(t>) : v € 
F(/)}|). Thus it is 0(n). We will refer to this procedure by MARK-FACE(f). 



It is known from [1] that, if G is a yes instance of FC{k) then |F(G)| < 2n — 3k + 6. 



4 




A Direct Algorithm for the Parameterized Face Cover Problem 217 



Similarly, marking a vertex v is done by deleting F(v) after replacing the 
index of v by -1 in the list(s) V{f) of each / G F{v). This procedure, denoted 
by MARK-VERTEX{v), takes time 0{\{V{f) : f G T’(f)}|) which is 0(n). 
To show that such simple operation is sound, we prove it to be equivalent to a 
surgical operation on the graph. 





Fig. 3. (a) Before selecting face /. (b) After selecting /. The circled vertices are 
marked. For simplicity, edges between marked vertices are not contracted in this 
figure. 



2.2 A Surgical Operation 

Two neighbors, u and w, of a vertex, v, are consecutive if some face, /, contains 
the three vertices such that one if the two ordered tuples (u,u,w) and {w,v,u) 
is a sub-tuple of the ordered tuple of /. 

If a vertex, v, is of degree at least 2, and is marked, then active faces that 
are adjacent to it will not be needed to cover it. Deleting v could lead to wrong 
answers as shown in Figure 2. So, prior to removing v, we make sure that the 
marking operation avoids such cases. The marking operation simply consists of 
(1) contracting edges between v and all marked neighbors of v, then (2) adding 
edges between consecutive active neighbors of v (even if they are adjacent) and 
marking all faces that contain v in the resulting graph. This action is safe in the 
following sense: every face that is adjacent to v and is used to cover a neighbor 
of V, must contain two consecutive neighbors of v. Thus, adding an edge between 
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consecutive neighbors preserves active faces that might later be needed to cover 
neighbors of v (and possibly other vertices). We refer to this operation on the 
graph by the surgical operation. In our implementation of the algorithm, we did 
not need to perform the surgical operation. It is used only to show that the 
algorithm has the claimed performance and the operations of marking vertices 
and faces are sound. 

Notice also the case where only one vertex of a face is left active at a certain 
point. In this case, the face is still active unless the vertex belongs to some other 
face. We shall see that preprocessing rules detects this case and deals with it, 
leaving no such faces in the graph. Figure 3, which shows (a snapshot of) the 
effect of the surgical operation, depicts some pendant faces. 

3 Preprocessing 

For simplicity, we assume the given graph is connected. Due to the chosen input 
representation, however, our algorithm works in both cases. Our preprocessing 
consists of two main rules: 

The Dominated Face Rule: If two faces, / and /', of G are such that V (/) C 
V{f), then / is said to be dominated by /'. In such case, / can be marked 
since every face cover of G that contains / can be replaced by a (possibly 
better) cover that contains /'. 

The Dominated Vertex Rule: If two vertices, u and v, are such that F{u) C 
F{v), then v is said to be dominated by u, and v is marked since any face 
that covers u will cover v. 

Checking if a vertex (face) is dominated is done by a search through all the 
sets of {F{v) : V &V} (|V(/) : / G F’}). It thus takes linear time. It follows that 
checking if the graph has dominated vertices or faces can be accomplished in 
O(n^). Each dominated vertex or face is then marked in 0(n) time. Therefore, 
the total run time of preprocessing is O(n^). We shall refer to this preprocessing 
algorithm by procedure PREPROCESS later in this paper. 

The above two rules generalize many simple preprocessing techniques. For 
example, let P be a path in G whose length exceeds two. If all interior vertices 
of P are of degree two, then we can reduce the problem size by contracting all 
interior edges of P (edges between interior vertices), thus keeping one (neces- 
sary) interior vertex of degree two. This turns out to be a special case of the 
dominated vertex rule since two interior degree-two vertices have the same set of 
active faces. Another example is the pendant vertex rule, which simply adds the 
unique face containing a pendant vertex to the cover. Such rule is automatically 
accounted for, since it is also a special case of the dominated vertex rule (the 
unique neighbor of a pendant vertex is dominated) . 

4 Kernelization 



This section is devoted to the proof of the following theorem. 
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Theorem 1. Let G = (V,F) be an instance of FC{k). There is an algorithm 
that runs in O(n^) and produces another, instance G' = {V , F'), of FG(k') such 
that: 

(i) G is a yes instance of FG{k) G' is a yes instance of FC{k'), 

(ii) k' < k, and 

(hi) |y(G')| < 2 k'^. 



Lemma 2. Three or more vertices of a plane graph, G, may not be common to 
more than two faces of G. 

Proof. Assume vertices ui,U2, and U3 belong to three faces, /i, /2, and /s. Con- 
struct another plane graph, G', by drawing three vertices, vi,V2, and V3 such 
that vertex Vi is placed inside face fi and joined to all vertices that lie on the 
boundary of fi. The subgraph of G' induced by vertices {Mi}?=i and {ui}i=i is 
isomorphic to K3 3. This is a contradiction. 

Lemma 3. Let G be a yes instance of FC{k). If two faces of G have more than 
2k common vertices, then any face cover of size k contains at least one of them. 

Proof. Let G be a face cover of size k. Assume faces fi and /2 contain 2fc -I- 1 
common vertices and are not elements of G. Then, by the pigeon hole principle, 
some element of G must cover at least three of the 2k +1 common vertices of fi 
and /2. This is impossible by lemma 2. 

Corollary 1. Let G be a yes instance of FG{k). If no pair of faces of G have 
2k + 1 common vertices, then every face whose length exceeds 2k^ is in any 
optimal face cover of G. 

Corollary 2. Let G be a yes instance of FC{k). If f G F has more than 2k 
vertices with more than k faces, then / must be in any optimal face cover of G. 

We now describe the kernelization process and use it in a proof of Theorem 
1. This process is referred to, later, by procedure KERNELIZE. 

The first step in KERNELIZE is a simple search for all faces of length 
> 2/c'^, where k' is originally equal to k. This is done in 0(n) time since the 
length of a face is captured when reading input. All faces that are found whose 
length exceeds 2k'^ are kept in a list, L. 

The second step is the following: For each face f G L, the number of common 
vertices with all other faces of L is computed. If a face /' has more than 2k' 
common vertices with / then their common vertices are all added to a list M 
of vertices that are to be marked®, and a virtual new vertex, v, is added to the 
list of vertices such that E{v) = {/,/'}. This operation is equivalent to adding 
a (virtual) degree-two vertex that dominates all common vertices of / and f . 
If no such face f exists, then, because of corollary 2, / is added to a list G of 



® Note that, by the surgical operation, active faces do not have marked vertices on 
their boundaries. 
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faces that are in the cover and are to be marked. Moreover, if the number of 
faces that share more than 2k' common vertices with / exceeds k' , then / is also 
selected in the cover and, thus added to list C. Otherwise, more than k' faces 
will have to be in the cover. Note that, because of Lemma 3, we stop the search 
if more than k' disjoint pairs of faces have (at least) 2k' + 1 common vertices. 

The last step deals with a cleanup of the lists and an update of the parameter: 
vertices in M are marked, faces in C are marked together with their boundary 
vertices, and the parameter k' is replaced by fc — \C\ {k is unchanged while C, 
being originally empty, is filled with cover faces) . Then list L is emptied and the 
process is repeated from step one until no more faces of length > 2k' are found 
(or no solution can be found). 

Proof of Theorem 1: 

Algorithm KERNELIZE just described satisfies the three conditions of the 
theorem. In particular, condition (iii) follows from corollary 1. As for the time 
complexity, we note the following: 

(i) Step one takes 0{n) since it’s a simple filling of list L by a one pass 
through the list of active faces. 

(ii) Step (ii) takes 0{kn): for each face f G L, we either find another face 
f that shares 2k' + 1 vertices with / or we add / to C . By Lemma 3, no more 
than k pairs of faces can share 2fc + 1 vertices, and no more than k faces can be 
added to C. Hence, the number of iterations in step two is at most k throughout 
the whole process. 

(iii) The last (cleanup) step takes time O(n^) since it deals with marking 
vertices and faces that are in M and C respectively. The proof of Theorem 1 is 
now complete. 



5 A Direct Face Cover Algorithm 



Our direct algorithm is represented by the procedure F ACECOVER shown 
below. Subroutines PREPROCESS, KERNELIZE and MARK-EACE cor- 
respond (obviously) to the processes previously described in detail. 

We shall prove that, at every call to EACECOV ER, the selected vertex, v, 
has no more than five active faces in its F{v) list. We know that the first call is 
guaranteed to select such vertex. As a remark, we note that, at least three such 
vertices are present in the graph. This is guaranteed by virtue of Euler’s formula 
and the following lemma, which first appeared as a corollary in [11]. 

Lemma 4. If G is a planar graph, then G has at least three vertices of degree 
< 5. 



Proof. Let m = |M = 

d{v) < 5}|. Then: 3n - 
Ev.Ld(v)+6("-b . ^ 

9 — 



{u G V{G) : d{v) > 5} I and I = 



- 6 > e(G) > 



E..Ld(v)+E. 



,d(v) 



d(v) 



+ 3n — 31. Therefore I > 2 + 



\L 

> 



= {t: G V{G) : 



E..Ld(v) 

6 



> 2 . 
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Procedure F ACECOVER 

Input: A plane graph G = (V,E) given by {F{u) : m G P} and {V{f) : f G 
F}, and an integer fc > 1. 

Output: A face cover, C, of size < A: of G if one exists. NULL otherwise. 

Begin procedure 
k' 

(G, G, k') ^ PREPROCESS{G, k') 

(G, G, k') ^ KERNELIZE{G, k') 

Select an active vertex v such that |f’(u)| = min{F{u)\u is active in U} 
For every / G F{v) do 
Gi ^GU{/}; 

Gi ^ MARK-FACE(f); 

(Gi,Gi,A:') ^ KERNELIZE{Gi,k' - 1); 

Ga ^ FAGECOVER{Gi,k')] 
if(Ga yf NULL) 
return Gi U Ga 
return NULL 
End procedure 



If a vertex, v, is the only active vertex of face /, then / will only be selected 
(and marked) if v doesn’t belong to any other face. Otherwise, it would be 
dominated (thus marked). We can, therefore, assume that every active face has 
at least two active vertices. 

Faces of length two may exist due to the surgical operation which could 
introduce multiple edges between two vertices. This case is easily handled by 
KERNELIZE since two vertices cannot belong to more than one face of length 
two (by the dominated face rule). 

Lemma 5. Let v be an active vertex of an annotated plane graph, G. Then no 
marked neighbor of v belongs to an active face of v. 

Proof. The lemma follows immediately from the surgical operation. 

Theorem 2. P ACEGOV ER runs in 0(5^ + ’nf) time. 

Proof. At each call to FAGECOVER, the (plane) subgraph induced by active 
vertices of Gi must have a vertex, v, of degree < 5. By Lemma 5, the active 
faces in F(v) are faces that are common to v and its active neighbors. Thus 
the number of such active faces would exceed five only if v has multiple edges 
with at least one of its neighbors. Which means that v belongs to faces of length 
two. However, a face of length two will only be active if it’s the unique face that 
is common to its two vertices. This proves that each node in the search tree 
has at most five children. Having no more than k levels, the search tree has at 
most 0(5^) nodes. Thus pure branching would take 0{b^kf) (after the (O(n^) 
kernelization). since interleaving is used as in [9], the run time of branching 
reduces to 0(5^ + kf) and the overall run time is (5^ + k^ + ’nf). This completes 
the proof. 
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6 Remarks 

As a preliminary test of efficiency, we have implemented our algorithm and tested 
it on random plane graphs of size up to 200. Answers were obtained in at most 
a few seconds. We plan larger and more systematic testing in the near future. 
We know of no other FC{k) implementations, but would be most interested in 
conducting extensive experiments and comparisons. 
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Abstract. We consider the parameterized problems of whether a given 
set of clauses can be refuted within k resolution steps, and whether a 
given set of clauses contains an unsatisfiable subset of size at most k. We 
show that both problems are complete for the class W[l], the first level 
of the W-hierarchy of fixed-parameter intractable problems. Our results 
remain true if restricted to 3-SAT formulas and/or to various restricted 
versions of resolution including tree-like resolution, input resolution, and 
read-once resolution. 

Applying a metatheorem of Frick and Grohe, we show that restricted to 
classes of locally bounded treewidth the considered problems are fixed- 
parameter tractable. Hence, the problems are fixed-parameter tractable 
for planar CNF formulas and CNF formulas of bounded genus, fc-SAT 
formulas with bounded number of occurrences per variable, and CNF 
formulas of bounded treewidth. 



1 Introduction 

Resolution is a fundamental method for establishing the unsatisfiability of a given 
formula in Conjunctive Normal Form (CNF) using one single rule of inference, 
the resolution rule. This rule allows to infer the clause CUD from clauses CU{a;} 
and D U {^a;}. A CNF formula is unsatisfiable if and only if the empty clause 
can be derived from it by repeated application of the resolution rule. Resolution 
is easy to implement and provides the basis for many Automated Reasoning 
systems. 

It is well known that certain unsatisfiable CNF formulas require an expo- 
nential number of resolution steps in order to be refuted [11]. Iwama [12] shows 
that, given a CNF formula F together with an integer k, deciding whether F 
has a resolution refutation with at most k steps is NP-complete. This result 
is strengthened by Alekhnovich et al. [2] by showing that the minimum num- 
ber of resolution steps cannot be approximated within a constant factor, unless 
P = NP (this result also holds for stronger proof systems like Frege systems). 
A closely related question is the “automatizability” of resolution: is there an 
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R. Downey, M. Fellows, and F. Dehne (Eds.): IWPEC 2004, LNCS 3162, pp. 223-234, 2004. 
@ Springer- Verlag Berlin Heidelberg 2004 




224 Michael R. Fellows, Stefan Szeider, and Graham Wrightson 



algorithm that finds a shortest resolution refutation R in polynomial time w.r.t. 
the number of steps in R1 Alekhnovich and Razborov [3] show that resolution is 
not automatizable, assuming a parameterized intractability hypothesis regard- 
ing W[P]. For a survey of further results on the complexity of resolution, see, 
e.g., Beame and Pitassi [4] or Clote and Kranakis [6]. 

Parameterizing by the number of steps of a resolution refutation is of rele- 
vance if one has to deal with large CNF formulas which contain local inconsis- 
tencies. Evidently, one can use exhaustive search for finding a k-step resolution 
refutation of a CNF formula with n variables, yielding a time complexity of 
nP(k) ^ However, even if /c is a small integer, say k = 10, exhaustive search be- 
comes impractical for large n. The question rises whether one can find resolution 
refutations with a fixed number of steps significantly more efficient than by ex- 
haustive search. The framework of parameterized complexity [8] offers a means 
for addressing this question. Here, problems are considered in two dimensions: 
one dimension is the usual size n of the instance, the second dimension is the 
parameter (usually a positive integer k) . A parameterized problem is called fixed- 
parameter tractable (or fpt, for short) if it can be solved in time f{k) ■ for 

some computable function / of the parameter. The parameterized complexity 
classes W[l] C W[2] C • • • C W[P] contain problems which are believed to be not 
fpt (see [8]); since all inclusions are believed to be proper, the hierarchy provides 
a means for determining the degree of parameterized intractability. A parame- 
terized problem P fpt reduces to a parameterized problem Q if we can transform 
an instance (x, k) of P into an instance (x',g{k)) of Q in time f{k) ■ {f,g 

are arbitrary computable functions), such that (x,k) is a yes-instance of P if 
and only if {x' ,g{k)) is a yes-instance of Q. 

As a main result of this paper, we show that SHORT resolution refuta- 
tion, that is, refutability within k resolution steps, is complete for the class 
W[l]. We also show that this result holds true for several resolution refinements 
including tree-like resolution, regular resolution, and input-resolution. We es- 
tablish the hardness part of the result by an fpt-reduction of the parameterized 
clique problem. As it appears to be difficult to establish W[l]-membership by 
reducing the problem to the canonical W[l]-complete problem on circuit satisfi- 
ability, we use results from descriptive parameterized complexity theory. 

We show that refutability within k resolution steps can be expressed as a 
statement in positive (i.e., negation-free and V-free) first-order logic. This yields 
W[l]-membership as it was shown by Papadimitriou and Yannakakis [16] in the 
context of query evaluation over databases, that the evaluation of statements in 
positive first-order logic over finite structures is W[l]-complete. 

Along these lines, we also show W[l]-completeness of small unsatisfiable 
SUBSET, that is, the problem of whether at most k clauses of a given CNF for- 
mula form an unsatisfiable formula. Furthermore, we pinpoint that all our W[lj- 
completeness results remain valid if the inputs are confined to 3-CNF formulas. 

The notion of hounded local treewidth for classes of graphs (see Frick and 
Grohe [10]) generalizes several graph classes, like planar graphs, graphs of 
bounded treewidth, or graphs of bounded degree. By means of incidence graphs 
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(see Section 2.1) we can apply this notion to classes of CNF formulas. Special 
cases are planar CNF formulas (CNF formulas with planar incidence graphs) 
and of {k, s)-CNF formulas (CNF formulas with k literals per clause and at 
most s occurrences per variable). Frick and Grohe [10] show that the evaluation 
of first-order statements over classes of graphs with locally bounded treewidth 
is fixed-parameter tractable (the result holds also for finite structures whose 
Gaifman graphs have locally bounded treewidth). Applying this powerful result, 
we obtain fixed-parameter tractability of SHORT resolution refutation and 
SMALL UNSATISFIABLE SUBSET restricted to classes of CNF formulas with locally 
bounded treewidth. Thus the problems are tractable for planar CNF formulas 
and for (fc, s)-CNF formulas. 

Note that satisfiability is NP-complete for planar CNF formulas (Lichten- 
stein [15]) and (3,4)-CNF formulas (Tovey [18]), and even for the intersection of 
these two classes (Kratatochvfl [13]). However, satisfiability of CNF formulas of 
(globally) bounded treewidth is fixed-parameter tractable (Courcelle et al. [7], 
see also Szeider [17]). 



2 Preliminaries and Notation 

2.1 CNF Formulas 

A literal is a propositional variable a; or a negated variable we also write = 
X and x^ = ~^x. A clause is a finite set of literals not containing a complementary 
pair X, ^x. A formula in conjunctive normal form (or CNF formula, for short) 
F is a finite set of clauses. F is a k-CNF formula if the size of its clauses is at 
most fc; F is a {k, s)-CNF formula if, additionally, every variable occurs in at 
most s clauses. The length of a CNF formula F is defined as 1^1- ^ 

CNF formula F, var{F) denotes the set of variables x such that some clause of F 
contains x or —~x. A literal a;® is a pure literal of F if some clauses of F contain 
x^ but no clause contains x^~^. F is satisfiable if there exists an assignment 
T : var{F) {0, 1} such that every clause of F contains some variable x with 
t{x) = 1 or some negated variable ~^x with t{x) = 0; otherwise, F is called 
unsatisfiable. F is called minimal unsatisfiable if F is unsatisfiable and every 
proper subset of F is satisfiable. Note that minimal unsatisfiable CNF formulas 
have no pure literals. A proof of the following lemma can be found in Aharoni 
and Linial [1], attributed there to Tarsi. 

Lemma 1 A minimal unsatisfiable CNF formula has more clauses than vari- 
ables. 

The incidence graph I{F) of a CNF formula F is a bipartite graph; variables 
and clauses form the vertices of /(F), a clause C and variable x are joined by 
an edge if and only x & C or ^x & C (see Fig. 1 for an example). A planar 
CNF formula is a CNF formula with a planar incidence graph. 
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Fig. 1. The incidence graph I{G) of the planar (3,4)-CNF formula F = 
{Ci,...,C5} with Cl = {a;}, C2 = {^x,y,z}, C3 = {^x,y,^z}, C4 = {^y,z}, 

C5 = 

2.2 Resolution 

Let C\,C2 be clauses with x € Ci, ~^x G C2, and var{Ci) fl var{C2) = {a;}. The 
clause C = {C\ U C2) \ {a:, ->a;} is called the resolvent of Ci and C2. We also say 
that C is obtained by resolving on x, and we call C\,C2 parent clauses of C. 

Recall that a vertex of a directed graph is called a sink if it has no successors, 
and it is called a source if it has no predecessors. A resolution refutation i? is a 
directed acyclic graph whose vertices are labeled with clauses, such that 

1. every non-source of R has exactly two predecessors and is labeled with the 
resolvent of the clauses labeling its predecessors; 

2. R contains exactly one sink; the sink is labeled with the empty clause. 

We call a non-source vertex of i? a step. A clause labeling a source of R is called 
an axiom of R. R is a resolution refutation of a CNF formula F if all axioms of 
R are contained in F. It is well known that a CNF formula is unsatisfiable if and 
only if it has a resolution refutation (resolution is “refutationally complete”). 

In the sequel we will measure the size of resolution refutations in terms of 
the number of steps^. 

We refer to any decidable property of a resolution refutation as a resolution 
refinement. In particular, we will consider the following refinements: 

— Tree-like resolution: The directed acyclic graph is a tree. 

— Regular resolution: On any path from a source vertex to the sink, any variable 
is resolved at most once. 

— P-resolution: at each resolution step, at least one of the parent clauses is a 
positive clause (i.e., a clause without negated variables); 

— Input resolution: every vertex is either a source or has a predecessor which 
is a source. 

— Literal-once resolution: distinct resolution steps resolve on distinct variables. 

— Read-once resolution: distinct sources are labeled by distinct clauses. 

® Another possible measure is the length of a refutation, dehned as the total number of 
vertices (i.e., steps -|- source vertices). It is easy to verify that a resolution refutation 
with k steps has at most k + 1 sources, and so its length is at most 2 k + 1. Therefore, 
our results carry over if we bound the length instead of the number of steps. 
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Note that the first three refinements are refutationally complete, but the last 
three refinements are not. Note also that every literal-once resolution refutation 
is tree-like, read-once, and regular. Every input resolution refutation is tree-like. 



2.3 Locally Bounded Treewidth 

Treewidth, a popular parameter for graphs, was introduced by Robertson and 
Seymour in their series of papers on graph minors; see, e.g., Bodlaender’s survey 
article [5] for definitions and references. 

Let u be a vertex of a simple graph G and let r be some positive integer. 
Nq{v) denotes the r -neighborhood of v, i.e., the set of vertices of G which can be 
reached from w by a path of length at most r. A class of graphs is said to have 
locally hounded treewidth if there exists a function / such that for all r > 1 and 
all vertices w of a graph G of that class, the treewidth of the subgraph included 
by Nq{v) is at most f{k). (Intuitively, the treewidth of the subgraph induced 
by an r-neighborhood of a vertex is a function of r and so less than the total 
number of vertices of G.) We give some examples of classes of graphs with locally 
bounded treewidth (see Frick and Grohe [10] for references). 

— By trivial reasons, the class of graphs of treewidth < t has locally bounded 
treewidth (/(r) = t). 

— The class of planar graphs has locally bounded treewidth (/(r) = 3r); more 
generally, the class of graphs with genus < g has locally bounded treewidth 
(/(r) = 0{gr)). 

— The class of graphs with maximum degree < d has locally bounded treewidth 
(/(r) = d(d- 1)^-1). 

3 Statement of Main Results 

Consider the following two parameterized problems. 

SHORT RESOLUTION REFUTATION 
Input: A CNF formula F. 

Parameter: A positive integer k. 

Question: Can F be refuted by at most k resolution steps? (i.e., can 
the empty clause be inferred from F hy k applications of the resolution 
rule?). 

SMALL UNSATISFIABLE SUBSET 
Input: A CNF formula F. 

Parameter: A positive integer k. 

Question: Does F contain an unsatisfiable subset F' with at most k 
clauses? 



Our main results are as follows. 
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Theorem 1 SHORT resolution refutation is yV[V\-complete. 

The problem remains W[l\-complete for the following resolution refinements: 
tree-like resolution, regular resolution, P-resolution, input resolution, read-once 
resolution, and literal-once resolution. 

Theorem 2 small unsatisfiable subset is W[l]-complete. 

Both theorems remain valid if inputs are confined to 3-CNF formulas. 

We show fixed-parameter tractability for classes of CNF formulas whose in- 
cidence graphs have locally bounded treewidth: 

Theorem 3 For CNF formulas of locally bounded treewidth, the problems 
short resolution refutation and small unsatisfiable subset are fixed- 
parameter tractable. 

Tractable cases include: planar CNF formulas, CNF formulas of bounded genus, 
and {k, s)-CNF formulas (fc-CNF formulas with at most s occurrences per vari- 
able). 

4 Proof of W[l]-hardness 

We are going to reduce the following well-known W[l]-complete problem. 
CLIQUE 

Input: A graph G. 

Parameter: A positive integer k. 

Question: Is there a set V CV (G) of k vertices that induces a complete 
subgraph of G (i.e., a clique of size k)l 

Given a simple graph G = (V,E), \V\ = n, and a positive integer k. We take 
distinct variables: for 1 < z < k, yij for 1 < z < j < A:, and for v G V and 

1 < i < k. We construct a CNF formula 

Pg — { Ggtart } C Fledges C Ayertices C Adean-up 

where 

Ggtart — • T k) ^yij . 1 ^ Z ^ j C: A:}, 

Fedges — { { f C: i ^ j C: k, UV G E'^, 

Fvertices — { ^v,i} : f C: i C: k, V G V , 

Fclean-up — { : f C: i C: k, V G V \ . 



k' = 




+ 2k. 



We put 
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Lemma 2 The following statements are equivalent. 

1. Fg has an unsatisfiable subset F' with at most k' + 1 clauses; 

2. G contains a clique on k vertices; 

3. Fg has a resolution refutation with at most k' steps which complies with the 
resolution refinements mentioned in Theorem 1; 

4- Fg has a resolution refutation with at most k' steps. 

Proof. 1^2. We assume that Fg is unsatisfiable and choose a minimal unsatis- 
fiable subset F' C Fg. First we show that 

Csjtart € F' . (1) 

Assume the contrary. Since F' has no pure literals, and since the variables Xi and 
Pij occur positively only in Cstart, we conclude that ^Vertices H F' = FedgesOF' = 
0. Hence, in turn, Fdean-up n F' = 0, thus F' = 0. However, the empty formula 
is satisfiable, a contradiction. Thus Cstart is indeed in F'. Since every clause in 
Fedges U Fvertices Contains the complement of exactly one variable of Cstart, it 
follows that 

|Fedges n F I > ) (2) 

iFverticesHF'l > fc. (3) 

It also follows that for every i G {!,..., fc} there is some v G V such that 
Zy^i G war( Fvertices H F'). The latter implies 

iFelean-upHF'l > A:. (4) 

Since |F'| < A:' -b 1 by assumption, (1) and the estimations (2)-(4) yield |F'| = 
A:'-|-l. Hence the estimations (2)-(4) must be tight. Consequently, strengthening 
the above observation, we conclude that for every i G {!,..., k}, there is exactly 
one vertex v G V such that G uar( Fvertices C F'). Let : {1, . . . , fc} ^ H be 
the map defined by 

ip{i) = V if and only if G uar( Fvertices C F'). 

In view of the tightness of the above estimations, we conclude that 

vari^F ) — Cgtart U { . 1 ^ A ^ Ai }. (5) 

Consequently, 

Fedges n F' = { : I < i < j < k, ip{i)(fi{j) G E}. 

We conclude that the vertices (p{l) , . . . , (p{k) are mutually distinct; thus 
(/?(!), . . . , <p{k) induce a clique of size k in G. 

2=b3. Assume that G contains a clique on k vertices. Consequently, there is an 
injective map ip : {1, . . . , k} ^ V such that ip{i)(p{j) G F for all 1 < i < j < k. 
We devise an input resolution refutation R of Fg, proceeding in three phases: 
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1. For 1 < z < j < fc we resolve Cstart with the clauses {^yij, ^ip{j),j} G 

Fedges- We end up with the clause C" = { Xi, ^ip(i),i : * = 1, • ■ • , fc }• 

2. For 1 < z < fc we resolve C with the clauses {^Xi, Zcp(i)^i} G Fvertices- We 

end up with the clause C" = { : z = 1 , . • . , fc }. 

3. For 1 < z < fc we resolve C" with the clauses G Fdean-up- We end 

up with the empty clause. 

By construction, i? complies with the resolution refinements as claimed. More- 
over, K contains Q) + k + k = k' resolution steps. 

3=^4. Trivial. 

4=^1. Assume that Fq has a resolution refutation F with at most k' steps. 
Let F' denote the set of axioms of F. Note that F' is necessarily unsatisfiable, 
and since F has at most fc' -I- 1 sources, \F'\ < k' + I follows. □ 

The construction of Fq from G can be carried out in time f{k){\V\ + 
for some function /. Thus Lemma 2 yields an fpt-reduction from clique to 
SHORT RESOLUTION REFUTATION with respect to the resolution refinements 
mentioned in Theorem 1, and an fpt-reduction from clique to small unsat- 
isfiable SUBSET. Since clique is well-know to be W[l]-complete [ 8 ], we have 
established the hardness parts of Theorems 1 and 2. 

4.1 3- CNF Formulas 

Using a slight modification of the above construction, we can show that the 
above hardness results hold for 3-CNF formulas. By means of a variant of Tseitin 
Extension [19], we transform a CNF formula F with clauses of size > 2 into a 
3-SAT formula t{F), applying the following operations. 

— Replace a clause {zci, . . . , zz;„} of size zz > 3 by the clauses {wi,W 2 ,ui}, 
{^Un- 3 ,Wn-i,Wn}, and {^zz* , Wi+ 2 , zZj+ 1 } for z = 1, . . . ,rz - 4 where Ui are 
new variables. 

— Replace a clause {zci,zc 2 } by the clauses {wi,W 2 ,u}, {^zz, zci, ZC 2 }, zz is a 
new variable. 

— Replace a clause {zc} by the four clauses {zz>, zzi, ZZ 2 }, {zc, zzi, ^zz 2 }, 
{zc, ^zzi, ZZ 3 }, {zc, ^zzi, ^zzs}, Ui are new variables. 

It is straightforward that F is satisfiable if and only if t{F) is satisfiable. More- 
over, if F is minimal unsatisfiable, then so is t{F), and the difference between 
the number of clauses and the number of variables remains the same for F and 

m. 

In view of the first part of the proof of Lemma 2 it follows that a minimal 
unsatisfiable subset F" of ^(Fg) contains all + k — 2 clauses of tdCstart}), 
( 2 ) clauses of t(Fedges), ‘2-k clauses of t(Fvertices), and 4fc clauses of t(Fdean-up)- 
In summary, the number of clauses in F" is exactly 

k” = 2^^ + 7k-2. 

The proof of Lemma 2 carries over to t{Fa) using k” instead of k' . 
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5 Membership in W[l] and FPT Resnlts 

Let S denote a finite relational structure and ip a first-order (FO) formula (we 
quietly assume that the vocabularies of ip and S are compatible). S' is a model 
of ip (in symbols S |= (p) if is true in S in the usual sense (see, e.g., [9, 14] 
for further model theoretic definitions). Model-checking, the problem of deciding 
whether S \= ip, can be parameterized in different ways; in the sequel we will 
refer to the following setting. 

FO MODEL CHECKING 

Input: A finite structure S, a FO formula ip. 

Parameter: The length of ip. 

Question: Is S a model of tp? 

Recall that a FO formula ip is positive if it does not contain negations or the 
universal quantifier V. We will use the following result of Papadimitriou and 
Yannakakis [16]. 

Theorem 4 FO model checking for positive formulas is W[l]~ complete. 

In [16] it is also shown that without the restriction to positive formulas, FO 
model checking is W[t]-hard for all t. 

We associate to a relational structure S its Gaifman graph G{S), whose 
vertices are the elements of the universe of S, and where two distinct vertices 
are joined by an edge if and only if they occur in the same tuple of some relation 
of S. By means of Gaifman graphs, one can speak of the treewidth of a relational 
structure and of classes of structures with locally bounded treewidth. 

We shall use the following strong result of Frick and Grohe [10]. 

Theorem 5 FO model checking for structures with locally hounded treewidth 
is fixed-parameter tractable. 

In the subsequent discussions, p denotes any of the resolution refinements 
mentioned in Theorem 1. 

Let yi,y 2 ,... be an infinite supply of variables. For k > I we define the 
following classes of GNF formulas. 

— denotes the set of GNF formulas F with var{F) = {yi, . . . , j/fc/} for some 
fc' < k. 

— denotes the set of minimal unsatisfiable formulas in with at most 
fc -I- 1 clauses. 

— TZ^ denotes the set of GNF formulas F G such that F is the set of axioms 
of some resolution refutation with at most k steps; TZ^ is TZ^ restricted to p- 
resolution. 



Lemma 3 Every formula F G TZ^ has at most k 1 clauses. 
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Proof. We proceed by induction on fc. If fc < 1 then the lemma holds trivially, 
since either F = { 0 } or = {{j/i}, {^ 2 /i}}- Assume that k > 2 and F G 
. Consequently, there is a resolution refutation R with exactly k steps 
such that F is the set of axioms of R. We observe that R must contain a step 
Vo where both predecessors vi,V2 of vq are sources. Let Ci denote the clause 
which labels Vi, 0 < i < 2 . We remove vi and V2 from R and obtain a resolution 
refutation R' with k — 1 steps. The vertex vq is now a source of R' . Let a and 
a' denote the number of axioms of R and i?', respectively. Observe that a' is 
minimal if ( 1 ) Cq is an axiom of R and ( 2 ) Ci,C2 are not axioms of R'. Thus 
a' > a — 2 + 1 . Since the set of axioms of R' belongs to , we have a' < fc by 
induction hypothesis, hence |F| = a < A: + 1 follows. □ 

Since there are less than 3 ^ clauses over the variables {y\, . . . ,yk\ (a vari- 
able appears positively, appears negatively, or does not appear in a clause), we 
conclude the following. 

Lemma 4 The sets and TZp are finite and computable. 

We represent a CNF formula F by a relational structure Sp = (P, N,V) as 
follows. For every variable a; of F and every clause C of F, the universe of Sp 
contains distinct elements Ox and ac, respectively. The relations of Sp are 

F = { {ax^ac) '■ X G var{F), C G F, x G C} (positive occurrence), 

N = { (ax,ac) ■ X G var{F), C G F, ~^x G C} (negative occurrence), 

V = { ax '. X G var{F) } (being a variable). 

For example, the formula of Fig. 1 is represented by the structure Sp = (P,N,V) 
with P = {{ax,aci), {ay, 002), (0^,003), (0^,002), (2,004)}, ^ = {{ax,ac2), 
{ax,ac3), {ay,acj, {ay,ac^), (0^,003), {a^,ac^)} and V = {ax, ay, a^,}. 

In order to express that two variables are distinct without using negation, 
we also consider the structure Sp = (F, N, V, D) with the additional relation 

D = { {ax, ax') : x,x' G var{F), x x' } (distinctness). 

The next lemma is a direct consequence of the definitions (cf. Fig. 1 ). 

Lemma 5 The incidence graph I{F) and the Gaifman graph G{Sp) are iso- 
morphic for every GNF formula F. 

Let fc > 1 and take two sequences of distinct FO variables v = v\,...,Vk 
and w = wi, . . . ,Wk+i. For a CNF formula F G with F = {Ci, . . . ,Ck"}, 
k" < k 1 , and |uor(F)| = fc' < fc we define the quantifier- free formula 

k'' / 

p[F]= A = A P{'^^^Wj)A A N{v„Wj) 

l<i<j<k' j=l XviGCj 

Furthermore, for G {jGi^,TZp} we define 

k 

f\v{v.)A V m 

i=l 





= 3 v 3 w 
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Similarly we define positive formulas ^p^[F] using “D{vi,Vjy^ instead of = 
Uj” and using instead of ip[F]. 

Lemma 6 For every CNF formula F the following holds true. 

1. F has a p-resolution refutation with at most k steps if and only if Sp ^ ‘P[F-’y\ 
(i.e., Sy h 

2. F contains an unsatisfiable subset of size at most k+1 if and only if Sp \= 
p[M^] (i.e., Sy \= Lp^[M^]). 

Proof. Let i? be a p-resolution refutation of F with at most k steps, and let 
F' denote the set of axioms of R. Since all variables occurring in axioms of R 
are resolved in some of the resolution steps, \var{F')\ < k follows. We put k' = 
|t;ar(F')| and pick arbitrarily a bijection r : var{F') {yi, . . . ,yk'}. Renaming 
the variables in F' according to r yields a formula r{F') which belongs to TV(, C 
TZ’f (observe that |J^'| = |r(J^')| < k + 1 by Lemma 3). It follows now from the 
definition of +\R^p] that Sp |= <p[7^p] (equivalently, that Sp |= (p+ [7?.^]). 

Now assume that F contains an unsatisfiable subset F' with at most fc -I- 1 
clauses; we may assume that F' is minimal unsatisfiable. By Lemma 1 it follows 
that |uar(i^')| < k. Consequently, as in the previous case, we obtain from F' by 
renaming a formula r(F') G establishing Sp ^ Lp[M^] and Sp |= 

The converse directions follow directly from the respective definitions of TV(, 
and . □ 

To complete the proofs of Theorems 1, 2, and 3, it only remains to join 
together the above results: In view of Theorem 4, Lemma 6 implies directly 
the W[l]-membership part of Theorems 1 and 2. Whence Theorems 1 and 2 
are shown true. Furthermore, Theorem 3 follows directly from Theorem 5 by 
Lemmas 5 and 6. 

6 Concluding Remarks 

Numerous parameterized problems have been identified as being W[l]-complete, 
for example, the Halting Problem for nondeterministic Turing machines, parame- 
terized by the number of computation steps. Our Theorem 1 links parameterized 
complexity with the length of resolution refutations, another fundamental con- 
cept of Logic and Computer Science; thus our result provides additional evidence 
for the significance of the class W[l]. 

Our positive results, the fp-tractability of SHORT resolution refuta- 
tion and SMALL UNSATISFIABLE SUBSET for classes of CNF formulas of locally 
bounded tree-width, are obtained by application of Frick and Grohe’s metathe- 
orem which does not provide practicable algorithms. However, the results show 
that fp-tractability can be achieved in principle, and so that further efforts for 
finding more practicable algorithms based on the particular combinatorics of the 
problems are encouraged. We think that the classes of planar CNF formulas and 
{k, s)-CNF formulas are good candidates for such an approach. 
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Abstract. We present an algorithm for the parameterized feedback ver- 
tex set problem that runs in time 0((21gfc -|- 21glgfc + 18)^n^). This 
improves the previous 0(max{12^ , (41g algorithm by Raman et 

al. by roughly a 2*^ factor (n™ G is the time needed to mul- 

tiply two n X n matrices). Our results are obtained by developing new 
combinatorial tools and employing results from extremal graph theory. 
We also show that for several special classes of graphs the feedback ver- 
tex set problem can be solved in time for some constant c. This 

includes, for example, graphs of genus O(lgn). 



1 Introduction 

Given an undirected graph G, a feedback vertex set in G is a subset of vertices F 
in G such that G — F is acyclic. The size of a feedback vertex set F is |J^|. The 
FEEDBACK VERTEX SET problem (FVS) is: given a graph G and a positive integer 
k, decide if G has a feedback vertex set of size at most fc. It is well-known that 
the FVS problem is NP-complete on both directed and undirected graphs [17]. 
The minimization version of this problem has been studied intensively from the 
approximability point of view [1, 2, 13, 16, 24] due to its important applications 
in fields like circuit testing, deadlock resolution, and analyzing manufacturing 
processes [13, 18, 21]. For example, in the field of circuit testing, a small set of 
registers (vertices) needs to be identified in the circuit (graph) whose removal 
makes the circuit testable (i.e., the circuit needs to be acyclic) [18]. 

The FVS problem has also received considerable attention from the param- 
eterized complexity point of view [3, 4, 10, 11, 23]. A parameterized problem is 
said to be fixed-parameter tractable if the problem can be solved in time f{k)n^^^^ 
for some function / which is independent of n [11]. The class FPT denotes the 
class of all fixed-parameter tractable problems [11]. It was shown that the FVS 
problem on undirected graphs is in FPT [4, 10, 11], whereas it remains an open 
question whether the FVS problem on directed graphs is in FPT [11]. 

* The first author was supported in part by DePaul University Competitive Research 
Grant. 
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Once a problem has been shown to be in FPT, the search for better algo- 
rithms for the problem continues; that is, algorithms that remain practical for 
larger values of the parameter k. Successful examples of such developments in- 
clude the VERTEX COVER and planar dominating set problems (see [7, 15] 
and their references). The same holds true for the FVS problem on undirected 
graphs. Bodlaender [4], and Downey and Fellows [10], were the first to show that 
the problem is in FPT. In [11], Downey and Fellows presented an 0{{2k+ 
time algorithm for the problem. Becker et al. [3] gave a randomized algorithm 
running in time 0(4^fcn) that finds a minimum feedback vertex set of size k with 
probability at least 1 — (1 — 4“^)'^^ for an arbitrary constant c. By observing 
that every undirected graph on n vertices with minimum degree at least 3 has a 
cycle of length bounded by 2 Ig n -|- 1, Raman presented a very simple algorithm 
for the problem running in time 0((6fclg [22], where G is 

the running time of the best algorithm for multiplying two n x n matrices [8]. 
More recently, using some nice combinatorial techniques, Raman et al. [23] pre- 
sented an algorithm for the problem running in time 0(max{12’^ , (41gfc)^}n“) 
improving significantly the 0{{2k+ time algorithm given in [11] (when k 

is sufficiently large). 

In this paper we continue the efforts towards reducing the running time of 
the algorithms for the FVS problem. We develop new combinatorial tools and 
employ known results from extremal graph theory to show that the size of a 
minimum feedback vertex set is f2{n/lgn) in a graph with no cycles of length 
bounded by Ig n. This allows us to obtain an 0((2 Ig k+2 Ig Ig fc -|- 18)^n^) time al- 
gorithm for the FVS problem, improving the previous 0{max{12^ , (41g fc)^}n“) 
time algorithm in [23] by roughly a 2^ factor. Obviously, the running time of 
this algorithm is still far from being practical, and the question of whether the 
FVS problem can be solved in time remains open. 

We also consider the FVS problem on special classes of graphs. We show 
that the problem on graphs of genus O(lgn) and on RT’’-minor free graphs is 
solvable in time for some constant c. We also show that the problem 

on bipartite graphs, graphs of genus 0{rf) for any e > 0, and constant average 
degree graphs, can be solved in time for some constant c if and only if 

the FVS problem on general graphs can. 

2 Preliminaries 

Let G = (V, E) be an undirected graph. For a set of vertices S' in G we denote by 
G — S the subgraph of G that results from removing the vertices in S, together 
with all edges incident to them. A minimum feedback vertex set is a feedback 
vertex set of minimum size. We denote by 7(G) the size of a minimum feedback 
vertex set in G. For a vertex v, we denote by deg(v) the degree of v in G. 

Let T be a vertex in G such that deg{v) < 2. We define the following opera- 
tion, which is standard in the literature. If deg(v) = 1 then remove v (together 
with its incident edge) from G; if deg{v) = 2 and the two neighbors x and y oi v 
are not adjacent, then remove v and add an edge between x and y. Let us denote 
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this operation by Short-Cut(t>). We say that the operation Short-Cut() is ap- 
plicable to a vertex v, if either deg{v) = 1, or deg{v) = 2 and the two neighbors 
of V are non-adjacent. A variation of the following proposition appears in [23] 
(see [2] for a proof). 

Proposition 1 . (Lemma 1, [23]) Let G he an undirected graph and let v he a 
vertex in G to which the operation Short-Cut() is applicable. Let G' he the 
graph resulting from applying Short-Cut(ti). Then j{G') = 7(G)- 

We assume that we have a subroutine Clean(G) which applies the opera- 
tion Short- Cut 0 repeatedly to G until the operation is no longer applicable. 
It is clear from Proposition 1 that if G' is the resulting graph from applying 
Clean(G), then 7(G') = 7(G). The graph G is said to be clean, if Clean(G) 
is not applicable. Note that any degree-2 vertex in a clean graph must be on a 
cycle of length three. 

An almost shortest cycle in G is a cycle whose length is at most the length 
of a shortest cycle in G plus one. It is well-known that an almost shortest cycle 
in an undirected graph with n vertices can be found in time O(n^) [19]. It is 
also well-known that any undirected graph with minimum degree at least 3 has 
a cycle of length at most 21gn -I- 1 [12]. 

3 The Algorithm 

The basic idea behind most of the parameterized algorithms for the FVS problem 
presented so far has been to branch on short cycles and use the search tree 
method [11, 22, 23]. Suppose we are trying to determine if there exists a feedback 
vertex set in G of size bounded by k. Let G be a cycle in G of length 1. Then 
every feedback vertex set of G must contain at least one vertex of G. For every 
vertex v on G, we can include v in the feedback vertex set, and then recurse 
to determine if G — z; has a feedback vertex set of size k — 1. Let us call such 
a process: branching on the cycle G. If we let T{k) be the number of nodes in 
the search tree of such an algorithm that looks for a feedback vertex set of size 
bounded by k, then when the algorithm branches on a cycle of length I, T(k) can 
be expressed using the recurrence relation T(k) < I - T(k — 1) -I- 1. The number of 
nodes in the search tree corresponding to the algorithm is 0{{lmax)’^), where Imax 
is the length of the longest cycle the algorithm branches on [11]. The running 
time of the algorithm is now proportional to the number of nodes in the search 
tree multiplied by the time we spend at every node of the search tree to find a 
cycle and process the graph. Thus, to reduce the running time of the algorithm, 
it is desirable to branch on short cycles. Most parameterized algorithms so far 
hinge on this approach [11, 22, 23]. 

In this section we develop another algorithm that uses this approach (based 
on the algorithm in [23]). We present the algorithm in Figure 1 below. We prove 
its correctness and analyze its running time in the next section. 
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FVS-solver 

Input: an instance (G, k) of FVS 

Output: a feedback vertex set F of G of size bounded by k in case it exists 

0. F = 0; 

1. if G is acyclic then return(F); 

2. if fc = 0 and G contains a cycle then return(’NO’); 

3. apply Clean(G); 

4. let G be an almost shortest cycle in G of length 1; 

5. if / > 13 and k < 3y^ then return(’NO’); 

else if I > Ign + 1 and (Ig n > Ig A: + Ig Ig fc + 13) then return) ’NO’); 
else if I > max{2 Ig fe + 18, 2 Ig n — 9} then return(’NO’); 
else branch on G and update k, F, and G accordingly; 

Fig. 1. The algorithm FVS-solver 



4 Analysis of FVS-solver 

The main idea behind the analysis of the algorithm presented in [23] is that 
if the graph does not contain a cycle of constant length, then the size of the 
feedback vertex set k must be large. In particular, the following structural result 
immediately follows from [23]. 

Lemma 1. (Theorem 2, [23]) Let G he a graph on n vertices with minimum 
degree at least 3. If there is no cycle in G of length at most 12 then 7(G) > 

The above result shows that when a clean graph has no cycle of length 
bounded by 12 (note that no degree-2 vertex exists in G at this point since G 
does not contain a cycle of length 3), k > i^/n, and hence, Ign < 21g/c — 3. 
Since every undirected graph with minimum degree at least 3 must have a cycle 
of length bounded by 2 Ign -I- 1, G has a cycle of length bounded by 41g/c. An 
algorithm that branches on a shortest cycle will then either branch on a cycle 
of length at most 12, or of length at most 41g/c. According to the discussion in 
the previous section, this gives a search tree of size 0((maa:{12, 41g fc})^). 

In this section we develop new combinatorial techniques and employ results 
from extremal graph theory to improve this analysis. The structural results ob- 
tained in this section will allow us to prove an upper bound of 0((2 Ig k+2 Iglg k+ 
18)^) on the size of the search tree of the algorithm FVS-solver presented in 
the previous section. 

Let T be a tree. For a vertex w in T we denote by driv) the degree of v in T. 
A vertex v G T is said to be good if driv) < 2. The statement of the following 
lemma can be easily proved.^ 

An easy way to see why the statement is true is to note that the average degree of 
a tree is bounded by 2. 



3 
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Lemma 2. There are at least [g/2 + IJ good vertices in a tree on q vertices. 

For a good vertex u in T, {u} is said to be a nice set if driu) < 1, and for 
two good vertices u and v in T, the set {«., f } is said to be a nice set if (u, v) is 
an edge in T. 

Lemma 3. Let T he a tree, and let Ug be the number of good vertices in T . Then 
there exists at least [{ug + 1)/2J nice sets in T that are pairwise disjoint. 

Proof. Without loss of generality, we assume that T is rooted at a vertex r. We 
define the natural parent-child, and ancestor-descendent relationships, between 
the vertices in T. Note that each vertex in T is either the root vertex or has 
exactly one parent in T. For every vertex v in T, let T„ denote the subtree of T 
rooted at v and containing all its descendents. We will prove the following state- 
ment: if T„ contains q good vertices of T, then the number of pairwise disjoint 
nice sets in T„ is at least [(<?-l- 1)/2J . It is clear that the previous statement will 
imply the statement of the lemma because T = T^. (Observe that a good vertex 
in Ty might not be a good vertex in T, and this is why we require the q vertices 
to be good in T.) To prove the statement, let Ty be a rooted subtree of T, and 
proceed by induction on q. Ty must contain at least one leaf of T, so q > 0. If 
q < 2, then Ty must contain a leaf u in T, and {m} is a nice set. Therefore the 
number of (pairwise disjoint) nice sets in this case is at least 1 > [(g -I- 1)/2J. 
Suppose now that the number of pairwise disjoint nice sets in any rooted tree 
containing p good vertices from T, for 2 < p < q, is at least [{p + 1)/2J. We 
distinguish two cases. 

Case 1. v has at least two children. Let v\,. . . ,Vd, be the children of v in Ty. 
Let qi be the number of good vertices in T that are in Ty. , i = 1 ... d, and note 
that 1 < qi < q, and that q < qi + . . .qd + ^ (v might be good, in which case it is 
the root of the tree and d = 2). By the inductive hypothesis, each T„. contains 
at least [{qt + 1)/2J pairwise disjoint nice sets in T. Since every nice set in Ty^ is 
disjoint from every nice set in Ty. for 1 < i yf j < d, it follows that Ty contains 
at least [(gi -I- . . . -I- (7d + d)/2j = [(gi -|- . . . -|- 1 -|- d — 1)/2J > [(g -|- 1)/2J 

pairwise disjoint nice sets in T (note that d > 2). 

Case 2. v has exactly one child v'. In this case v must be a good vertex in T. If 
v' is good, let v” be the child of v' in Ty (note that v” must exist since g > 2). 
Now Ty>i contains g — 2 good vertices in T. By induction, the number of pairwise 
disjoint nice sets in T„// is at least [(g — 1)/2J. Since {v,v'} is a nice set which 
is disjoint from all the nice sets in T„//, it follows that the number of pairwise 
disjoint nice sets in Ty is at least [(g -I- 1)/2J. If v' is bad, then v' must have 
at least two children in Ty>. The proof now is identical to that of Case 1 by 
applying induction on the trees rooted at the children of vf 

This completes the induction and the proof. □ 

This lemma follows directly from Lemma 2 and Lemma 3. 




240 lyad Kanj, Michael Pelsmajer, and Marcus Schaefer 



Lemma 4. Let T he a tree on q vertices. There are at least q/4 pairwise disjoint 
nice sets in T. 

Let (G, k) be an instance of FVS, where G is clean and has n vertices, and 
assume that G does not have a cycle of length bounded by 12. Since G is clean 
and has no cycles of length 3, G has minimum degree at least 3. Let F be a 
minimum feedback set of G, let / = |F|, and T = G — F. Applying Lemma 4 to 
every tree in IF, we get that T (i.e., the trees in T) contains at least (n — /)/4 
pairwise disjoint nice sets. So if we let S be the set of pairwise disjoint nice sets in 
IF, then IS”! > (n — /)/4. We construct a graph Gp as follows. The set of vertices 
of Gp is F. The edges of Gp are defined as follows. Let {a, b} be a nice set in S. 
Since a and b are good in T (i.e., in the tree of F that they belong to) and both 
have degree greater or equal to 3 in G, o must have at least one neighbor in F, 
and b must have at least one neighbor F. Pick exactly one neighbor oi of a in 
F, and exactly one neighbor bi of b in F. Note that ai and bi must be distinct 
since G has no cycles of length 3. Add the edge (ai,6i) to Gp. Now let {a} be 
a nice set in S, then a must have at least two neighbors in F. We pick exactly 
two neighbors ai and 02 of a in F, and we add the edge (01,02) in Gp. This 
completes the construction of Gf- Since G has no cycles of length bounded by 
6, for any two distinct nice sets in S, the edges associated with them in Gp are 
distinct. This means that Gf is a simple graph with at least (n — /)/4 edges. 
We have the following lemma. 

Lemma 5. If Gp has a cycle of length I then G has a cycle of length hounded 
by 31. 

Proof. Let (vi, . . . , vi,v\) be a cycle in Gp. Since each {vi, Ui+i) (the index arith- 
metic is taken modulo 1) is an edge in Gf, {vi, r'i+i) is associated with a nice set 
Si in S. If Si = {o}, let Pi be the path (r;i,a,fi+i) in G; if Si = {o, 6}, let Pi 
be the path (uj, o, 6, Ui+i) in G. Since the nice sets in S are disjoint, the paths 
Pi, i = 1,. . . ,1, are internally vertex-disjoint in G, and they determine a cycle 
of length bounded by 3L □ 

The following result is known in extremal graph theory (see [5, 6, 14]). 

Lemma 6. Let G be a graph with n vertices where n>3.IfG does not contain a 
cycle of length at most 21, then the number of edges in G is bounded by 



Theorem 1. Let G he a graph with n >3 vertices and with no cycles of length 
hounded by max {12, Ign}. Then ^{G) > (n/(61 lgn))^“®/^'®”+®^. 

Proof. Suppose that G does not have a cycle of length bounded by max{ 12, Ig n} . 
Let F be a minimum feedback vertex set of G, and let / = |F| = 7(G). Since 
G has no cycles of length bounded by 12, if we let F and Gp be as defined 
in the above discussion, then it follows from above that the number of edges 
in Gp is at least (n — /)/4. Since G does not have a cycle of length bounded 
by Ign, by Lemma 5, Gp has no cycle of length bounded by (lgn)/3. Applying 
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Lemma 6 with I = (lgn)/6, we get that the number of edges in is bounded 
by 15 (lgn)/i+®/^ 8 "^ Thus 

(n-/)/4<15(lgn)/i+®/'s" 

n < 60(lgn)/i+®/‘s" + / < 60(lgn)/^+®/‘s" + /i+e/'s" 
n < 61(lgn)/i+®/'s" (1) 

Manipulating (1) we get / > (n/(61 completing the proof. 

□ 

Theorem 1 implies that the size of a minimum feedback vertex set in a graph 
with minimum degree at least 3 and no cycles of length bounded by Ig n must 
be I7(n/lgn). 

Corollary 1. Let G he a graph with n > 3 vertices and with no cycles of length 
bounded by maa:{ 12, Ign}. Then Ign < lg 7 (G) +lglg 7 (G) + 13. 

Proof. Applying the Ig () function on both sides of (1) in Theorem 1 we get 



lgn<lg61 + lglgn + lg/ + 61g//lgn (2) 

Since G has no cycles of length bounded by 12, from Lemma 1 we get Ign < 
2 Ig /, and hence Ig Ig n < 1 + Ig Ig /. Now noting that Ig 61 < 6 and Ig / < Ig n, 
it follows from (2) that Ig n < Ig / + Ig Ig / + 13. □ 



Lemma 7. Let G he a graph with n vertices and minimum degree at least 
3. For any integer constant d > 0, there is a cycle in G of length at most 
max{2lgj(G) + 21g (d - 1) + 3, 21gn - 21g (d + 1) + 4}. 

Proof. Let d > 0 be given, and let Z\ be the maximum degree of G. By Lemma 4 
in [23], 7 (G) > {S — 2)n/{2{A— 1)), where S is the minimum degree of the graph. 
Since <5 > 3, we get n < 2(Z\ — 1 ) 7 (G), and hence 



Ig n < Ig 7 (G) + Ig (Z\ - 1) + 1 
21 gn+K 21 g 7 (G) + 21g(Z\-l) + 3 (3) 

Let r be a vertex in G of degree A. Perform a breadth-first search starting at 
r until a shortest cycle is first encountered in the graph. Let I be the length of the 
shortest cycle first encountered in this process. Since r has degree A and every 
other vertex has degree at least 3, it is not difficult to show, using a counting 
argument, that the number of vertices in the graph is at least — 1) -I- 1 

if I is even, and A{2^''~^'>/^ — 1 ) -I- 1 if Hs odd. 
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Suppose that ? > 4. We have 

n > Z\( 2 ('- 2)/2 _ 1 ) + 1 
n > Z\( 2 ('- 2)/2 _ 

Ign > lgZ\ + lg(2('-2)/2 - 1) (4) 

Also I > 4 implies that 2^*“^^/^ > 2. Using the inequality Ig (a; — 1) > Igx — 1 
for a; > 2 in (4), and manipulating (4), we get 



1 < 21gn-21gZ\ + 4 (5) 

If Z < 4, then I = 3 and since n> A, (5) is still true. 

Now if Z\ < d, then from the fact that G has a cycle of length bounded 
by 21gn + 1, and from (3), we conclude that there is a cycle in G of length 
at most 21g7(G) + 21g(Z\ — 1) + 3 < 21g7(G) + 21g(d — 1) + 3. Otherwise, 
A > d + 1, and by (5), there is a cycle in G of length at most 2 Ig n — 2 Ig Z\ + 
4 < 21gn — 21g (d+ 1) + 4. It follows that G has a cycle of length at most 
maa:{21g7(G) + 21g(d — 1) + 3,21gn — 21g(d+ 1) + 4}. This completes the 
proof. □ 

Corollary 2. Let G be a graph with minimum degree at least 3. There exists a 
eyele in G of length at most max{2 Ig 7 (G) + 17, 2 Ig n — 10} 

Proof. Apply Lemma 7 with d = 127. □ 



Theorem 2. Let G be a graph with n vertices. Ln time 0((21gfc + 
2 Iglg fc + 18)^n^) we can decide if G has a feedback vertex set of size bounded 
by k. 

Proof. It is not difficult to see that the algorithm FVS-solver solves the FVS 
problem. In particular, if FVS-solver returns a NO answer in step 5, then either 
(i) I > 13 and k < 3-y/n, or {ii) I > Ig n + 1 and Ig n > Ig Zc + Ig Ig Zc + 13, or (Hi) 
I > max{2 Ig Zc + 18, 2 Ig n — 9}. Since I is the length of an almost shortest cycle, 
if (j) holds then G does not have a cycle of length bounded by 12, and hence no 
feedback vertex set of size bounded by k by Lemma 1. If (ii) holds, then G does 
not have a cycle of length bounded by max{ 12 , Ign}, and hence no feedback 
vertex set of size bounded by k by Corollary 1 (note that in this case n > 3, and 
I must also be greater than 13 since Ign > 13). Finally if (Hi) holds, then G has 
no cycle of length bounded by {21gZc + 17, 2 Ign — 10}. From Corollary 2 (note 
that G has minimum degree at least three at this point since G is clean and has 
no cycles of length three) we conclude that k must be smaller than 7 (G), and 
hence G has no feedback vertex set of size bounded by k. 

To analyze the running time of the algorithm, let I be the length of a cycle 
G that the algorithm branches on. By looking at step 5 in the algorithm, we see 
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that if the algorithm branches on C then one of the following cases must hold: 
(a) I < 13, or (b) ^ > 13, fc > 3-y/n, and I < lgn+1, or (c) Ign < Ig fc + lg Ig A: + 13 
and I < max{2lgk + 18, 21gn — 9}. 

If (b) holds, then the conditions in (b) give that I < 2lgk. If (c) holds, 
then combining the two inequalities in (c) we get I < max{2lgk + 18,21gA: + 
21glgfc + 17} < 21g/c + 21glgA: + 18. It follows that in all cases (a), (b), and 
(c), the algorithm branches on a cycle of length at most 21gfc + 21glgA: + 18. 
Thus, according to the discussion at the beginning of this section, the size of 
the search tree corresponding to the algorithm is 0{{2lgk + 21glgfc + 18)^). 
Now at each node of the search tree the algorithm might need to find an almost 
shortest cycle, call Clean(), check if the graph is acyclic, and process the graph. 
Finding an almost shortest cycle takes O(n^) time. When Clean() is applied, 
since every vertex that Clean() removes has degree bounded by 2, Clean() can 
be implemented to run in linear time in the number of vertices it removes, and 
hence, in 0(n) time. Checking if the graph is acyclic and processing the graph 
takes no more than 0{n^) time. It follows that the running time of the algorithm 
is 0((21g fc + 21glg A: + 18)''n^). □ 

According to the above theorem, the algorithm FVS-solver improves the 
algorithm in [23] by roughly a 2^ factor. 

5 FVS on Special Classes of Graphs 

In this section we consider the FVS problem on special classes of graphs. We look 
at the following classes: bipartite graphs, bounded genus graphs, and A''’-minor 
free graphs for fixed r. 



Bipartite Graphs 

Let G be a graph with n vertices and m edges. Consider the operation of subdi- 
viding an edge e in G by introducing a degree-2 vertex v. Then this operation 
is precisely the inverse operation of Short- Cut (u). Therefore, if we let G' be 
the graph obtained from G by subdividing an edge e G G, then it follows from 
Proposition 1 that 7 (G') = 7 (G). Subdividing each edge in G yields a bipartite 
graph G', which according to the previous statement satisfies 7 (G') = 7 (G). The 
graph G' has n + m vertices and 2m edges. This shows that the FVS problem on 
general graphs can be solved in time f{k)n^^^^ if and only if the FVS problem 
on bipartite graphs can be solved in f{k)'nP^^'^ time. In particular, an algorithm 
of running time (for some constant c) for the FVS problem on bipartite 

graph implies an algorithm of running time for the FVS problem on 

general graphs. 



Bounded Genus Graphs 

The following lemma follows from a standard Euler- formula argument. 
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Lemma 8. Let G be a graph with n vertices, minimum degree at least 3, and 
genus g. If n > 8g then there is a cycle in G of length at most 12. 

Let G be a graph with no vertices and genus go satisfying go < c Ig no for some 
constant c. Note that if we branch on a cycle in G or process G as in the algorithm 
FVS-solver, the number of vertices and genus of G change, and the relationship 
g < clgn, where n and g are the number of vertices and genus, respectively, in the 
resulting graph may not hold. However, since the operations in FVS-solver do 
not increase the genus of the graph, the genus g of the resulting graph satisfies 
9 If 9o ^ clgno- Consider the algorithm BGFVS-solver given in Figure 2, 
which is a modification of the algorithm FVS-solver presented in Section 1, 
that will solve the FVS problem on graphs of genus bounded by clgn. 



BGFVS-solver 

Input: an instance (G, k) of FVS where G has no vertices and genus go 
Output: a feedback vertex set F of G of size bounded by k in case it exists 
and go satisfies go < clgno 

0. F = 0; 

1. if G is acyclic then return(F); 

2. if fc = 0 and G contains a cycle then return (’NO’); 

3. apply Clean(G); 

4. if the number of vertices n of G is bounded by 8c Ig no then 

solve the problem by brute force and STOP; 

5. let G be an almost shortest cycle in G of length 1; 

6. if / > 13 then return(’Invalid Instance’); 

else branch on G and update k, F, and G accordingly; 

Fig. 2. The algorithm BGFVS-solver 



It is not difficult to see the correctness of the algorithm BGFVS-solver. 
The only step that needs clarification is when the algorithm returns in step 6 
that the instance is invalid. If this happens, then the number of vertices n in 
the resulting graph G must satisfy n > Sclgno, where no is the number of 
vertices in the original graph, and G does not have a cycle of length bounded 
by 12 (note that the algorithm finds an almost shortest cycle). Since the genus 
of the resulting graph g is bounded by the genus go of the original graph, if go 
satisfies the condition go < clgno then we would have 8g < 8go < Sclgno < n, 
which according to Lemma 8, would imply the existence of a cycle of length at 
most 12 in G (since G has minimum degree at least 3 at this point). Since no 
such cycle exists in G, the genus go in the original graph does not satisfy the 
condition go < clgno, and hence the input instance does not satisfy the genus 
bound requirement. Therefore the algorithm rejects the instance in this case. 
This shows actually that BGFVS-solver does not need to know in advance if 
the input instance satisfies the genus bound requirement or not. As long as the 
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input instance satisfies the genus bound requirement, the algorithm solves the 
problem. 

Now to analyze the running time of the algorithm, we note that step 4 can 
be carried out in time by enumerating every subset of vertices in the 

graph and checking whether it is a feedback vertex set of size bounded by k. The 
algorithm never branches on a cycle of length greater than 13. It follows that 
the running time of the algorithm is 0{15^ . 

Theorem 3. The FVS problem on graphs of genus O(lgn) can he solved in 
time 

Let G be a graph with no vertices and genus go, and let e > 0 be given. We 
construct a graph G' as follows. Let IT be a wheel on vertices. Add W to 
G by linking a vertex in W other than the center to any vertex in G. It is easy 
to verify that 7 (G') = 7 (G) + 2, and that a feedback vertex set of G can be 
constructed from a feedback vertex set of G' easily. Since a wheel is planar, it 
follows that the genus g' of G is equal to g. The number of vertices n' of G' is 
n' = no + r?o'^ ■ Therefore g' = g < Uq < n''^ . This shows that we can reduce the 
FVS problem on general graphs to the FVS problem on graphs of genus 0{n’^) 
(for any e > 0) in polynomial time such that if the FVS problem on graphs of 
genus 0{n'^) can be solved in time, then so can the FVS problem on 

general graphs. In particular, if the FVS problem on graphs of genus 0(n'^) can 
be solved in time then so can the FVS problem on general graphs.^ 



JV’’-Minor Free Graphs 

Let r be a constant, and let Qr be the class of graphs with minimum degree at 
least 3 and no A''’-minor. Combining theorems from [20, 25] and [26] as described 
in [9, pl80], we have the following: There exists a constant c such that, for all r, 
each G G Qr has a cycle of length less than cr^lg r. 

Let G be a AT’’-minor free graph with minimum degree at least 3. According 
to the result described above, there is a cycle in G of length bounded by ci = 
cr^/lgr, which is a constant. Therefore, if we modify the algorithm FVS-solver 
so that it branches on an almost shortest cycle after applying Clean(G), then 
the algorithm always branches on a cycle of length at most c' = 1 + max{3, ci}.® 
This shows that the running time of the modified algorithm is 

Theorem 4. For any constant r, the FVS problem on K^-minor free graphs 
can be solved in time 0{c'^n^), where c' is a constant. 

^ Using the same technique, one can prove that if the FVS problem on graphs with 
average degree bounded by a constant, can be solved in time f{k)n^^^\ then so can 
the FVS problem on general graphs. 

® Note that if the length of the almost shortest cycle that the algorithm computes is 
not bounded by c', the algorithm rejects the instance. 
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Abstract. The splitting method is one of the most powerful and well- 
studied approaches for solving various NP-hard problems. The main idea 
of this method is to split an input instance of a problem into several 
simpler instances (further simplified by certain simplification rules), such 
that when the solution for each of them is found, one can construct 
the solution for the initial instance in polynomial time. There exists a 
huge number of articles describing algorithms of this type and usually 
a considerable part of such an article is devoted to case analysis. In 
this paper we show how it is possible to write a program that given 
simplification rules would automatically generate a proof of an upper 
bound on the running time of a splitting algorithm that uses these rules. 
As an example we report the results of experiments with such a program 
for the SAT, MAXSAT, and (n, 3)-MAXSAT (the MAXSAT problem 
for the case where every variable in the formula appears at most three 
times) problems. 



1 Introduction 

There are several approaches for dealing with NP-hard problems. For example, 
for some problems the algorithms that find approximate solutions in polynomial 
time are known, but there are also algorithms that find exact solutions with 
some probability. However, in practice many problems have to be solved exactly, 
so in this paper we consider the most popular type of algorithms, so-called split- 
ting algorithms. A typical example of a splitting algorithm for the propositional 
satisfiability problem (SAT) is DLL-type algorithm [DLL62]. 

Many NP-hard problems can be formulated in terms of Boolean formulas (for 
example, [GHNROO] shows how the NP-hard graph problem MAXCUT can be 
formulated in terms of Boolean formulas) . Due to this fact we consider splitting 
algorithms for problems dealing with formulas in CNF (e.g., SAT, MAXSAT). 
Let us briefly describe the idea of a splitting algorithm for such problems. At 
each step it firstly simplifies an input formula using simplification rules. Then, 
if the obtained formula F is not trivial, it makes two recursive calls for formulas 
resulting from F by setting I = true and I = false, where I is a literal of the 
formula F (these formulas are usually denoted F[l] and F[l]). Finally, it returns 
the answer according to the answers returned by the recursive calls. For example. 
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in case of the SAT problem it must return the answer “Satisfiable” if and only if 
at least one of the recursive calls returns this answer. In general case, a splitting 
algorithm may call itself on a bigger number of formulas. The only condition on 
these formulas is the possibility to construct the solution for initial formula from 
the solutions for these formulas (in polynomial time). 

There is a lot of literature where splitting algorithms for different problems 
are described. Moreover, for a number of problems there is a “hierarchy” of arti- 
cles containing such algorithms, where each paper improves the result of a pre- 
vious one by more involved case analysis or by introducing a new simplification 
rule. For example, there exist splitting algorithms for Satisfiability (see [DHIVOl] 
for survey). Maximum Satisfiability ([BR99], [CK02], [GHNROO]), Exact Satis- 
fiability ([BMS03], [K02]), Maximum Cut ([FK02], [GHNROO]), and many other 
problems. Usually the size of a case analysis in this hierarchy increases with the 
improvement of a result. But this is not always true: sometimes a new simplifi- 
cation rule allows to prove a better result by considering fewer cases. 

In this paper we show how it is possible to write a program that given 
simplification rules would automatically generate a proof of an upper bound 
on the running time of a splitting algorithm that uses these rules. We present 
several automatically generated proofs of upper bounds for the SAT, MAXSAT 
and (n, 3)-MAXSAT problems. 

Organization of the Paper. Our paper is organized as follows: In Sect. 2 we give 
basic definitions. Section 3 contains the detailed description of our program. In 
Sect. 4 we discuss related work. Section 5 contains further directions (in partic- 
ular there we indicate several bottlenecks of all currently known frameworks for 
automated proofs of upper bounds for NP-hard problems). 

2 General Setting 

2.1 Basic Definitions 

Let U be a set of Boolean variables. The negation of a variable v is denoted 
by u, by U we denote the set {u | u G U}. Literals are the members of the set 
W = V UV .li w denotes a literal T, then w denotes a literal 1. A clause is the 
disjunction of a finite set of literals that does not contain simultaneously any 
variable together with its negation. The empty clause is interpreted as False. A 
formula in CNF is the conjunction of a finite set of clauses. The empty formula 
is interpreted as True. The length of a clause is the number of its literals. The 
length of a formula is the sum of lengths of all its clauses. We say that a literal 
I occurs in a clause or in a formula, if this clause or this formula contains the 
literal 1. However, we say that a variable v occurs in a clause or in a formula, if 
this clause or this formula contains the literal v, or it contains the literal v. An 
{i, j)-literal is a literal that occurs in a formula i times positively and j times 
negatively. 

An assignment is a finite subset of W that does not contain any variable 
together with its negation. Informally speaking, if an assignment / contains a 
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literal I, it means that I has the value True in I. To obtain a formula F[I] 
from a formula F and an assignment / = {h, . . . ,ls}, we first remove all clauses 
containing the literals k from F, and then delete all occurrences of the literals 
li from the other clauses. 

2.2 Estimation of the Size of a Splitting Tree 

Kullmann and Luckhardt introduced in [KL98] a notion of a splitting tree. One 
can consider an execution of a splitting algorithm as a tree whose nodes are la- 
beled with CNF-formulas such that if a node is labeled with a CNF-formula F, 
then its sons are labeled with simplified formulas F[Ii], F[l 2 ], . . . , F[Ik] for as- 
signments Ii, I 2 , ■ ■ ■ , Ik- To each formula F of this tree we attach a non-negative 
integer fJ-(F), which denotes the complexity of F. We use the following measures 
of complexity in our program: 

1. /i(E) = N{F) is the number of variables in F; 

2. /i(F) = K{F) is the number of clauses in F; 

3. p,{F) = L{F) is the length of F. 

The tree is a splitting tree if, for each node, the complexity of the formula la- 
belling this node is strictly greater than the complexity of each of the formulas 
labelling its sons. 

Let us consider a node in our tree labeled with a formula Fg. Suppose its 
sons are labeled with formulas Fi, F 2 , . . . , F^- The splitting vector of this node is 
the TO-tuple {ti,t 2 , - ■ - ,tm), where F are positive numbers not exceeding /i(Fb) ~ 
p.{Fi). The characteristic polynomial of this splitting vector is defined as follows: 
h{x) = 1 — The only positive root of this polynomial is called the 

splitting number and denoted by r{ti,t 2 , ■ ■ ■ , tm)- The splitting number of a tree 
T is the largest of the splitting numbers of its nodes. We denote it by Tmax,T- 

Now we have provided all the necessary definitions to present a lemma proved 
by Kullmann and Luckhardt in [KL98] that allows to estimate the number of 
leaves in a splitting tree using its splitting number. 

Lemma 1. Let T be a splitting tree, and let its root be labeled with a formula 
Fq. Then the number of leaves in T does not exceed ■ 

Let us show how this lemma helps to estimate the running time of a splitting 
algorithm. As we already mentioned a splitting algorithm at first simplifies the 
input formula and then makes several recursive calls of itself for formulas with 
smaller complexity. Clearly, the total running time of such an algorithm is the 
total running time of all recursive calls plus the time spent to make these calls 
(note that we consider only splitting algorithms that at each step make a number 
of recursive calls bounded by a constant). Therefore, the running time is within 
a polynomial factor of the number of nodes (or leaves) of the recursion tree. 
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2.3 SAT, MAXSAT, and (n, 3)-MAXSAT Problems 

In this subsection we give definitions of NP-hard problems on which we tested 
our program. 

The propositional satisfiability problem (SAT) is: given a formula in CNF, 
check whether there exists an assignment of Boolean values to all variables of 
this formula that satisfies all its clauses. In the maximum satisfiability problem 
(MAXSAT) the question is to find the maximum possible number of simultane- 
ously satisfied clauses in a given CNF formula. The (n, 3)-MAXSAT problem 
is the version of MAXSAT, where each variable appears in the input formula at 
most three times. 

Currently best upper bounds for these problems are the following: 

- SAT: 0(1.238823^), 0(1.073997^) ([HOO]), 

- MAXSAT: 0(1.341294*^) ([CK02]), 0(1.105729*) ([BR99]), 

- (n,3)-MAXSAT : 0(1.324719*") ([BR99]). 

All these bounds are obtained by using the splitting method. Note that there are 
no known upper bounds of the form 0(c*"), where c < 2 is a constant, for SAT 
and MAXSAT problems. We do not know any upper bounds for (n, 3)-MAXSAT 
w.r.t. number of clauses and length of the formula that improve the correspond- 
ing bounds for MAXSAT. 

3 The Algorithm 

We start with giving several non-standard definitions that are convenient in the 
context of this paper and then present the algorithm on which our program is 
based. 

A typical splitting algorithm first simplifies an input formula and then splits 
the resulting formula. Usually it makes splittings according to some properties 
of a formula. That is, in a code of a splitting algorithm the application of sim- 
plification rules to the input formula is followed by several cases of the form as: 
if a formula contains such clauses or literals . . . , then split as .... To prove the 
correctness of such an algorithm one has to prove that each simplified formula 
(we say that a formula is simplified if no simplification rule is applicable to it) 
satisfies the condition of at least one of these cases. Note that a set of formu- 
las satisfying the condition of a case corresponds to each such case, and the 
algorithm is correct if the union of all these sets contains the set of simplified 
formulas. In the following subsection we introduce a notion of a class of formulas 
which is convenient for describing splitting algorithms. 



3.1 Class of Formulas 

Let C be a clause consisting of literals li,l 2 , ■ ■ ■ Jk- We define a clause with 
unfixed length as a set of clauses containing all these literals and probably some 
more literals and denote it by {li V I 2 'd ■ ■ ■ 'd h ■ ■ ■)• The literals I 1 J 2 , ■ ■ ■ ,lk we 
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call the basis of this clause. For example, {I ■ ■ ■) is a set of all clauses containing 
the literal 1. In the following we use the word “clause” to refer both to clauses 
in its standard definition and to clauses with unfixed length. 

Similarly we define a class of formulas. Let C( , . . . , be clauses (some of 
them may have unfixed lengths). Then a class of formulas is a set of formulas 
represented as Ci A ... A Cm , such that the following conditions hold: 

1. m > k, 

2. for I < i < k, Cl C Ci (as sets of literals), if C' is a clause with unfixed 
length, and Ci = Cl otherwise, 

3. for 1 < i < k, k + 1 < j < m, Cj does not contain any variable from the 
basis of C'. 

We denote this set by A . . . A . . . and call the clauses , . . . , the basis of 
this class of formulas. We say that variables from clauses C(, . . . , C(, are known 
for this class. When we say that a class of formulas contains some literal we 
mean that its basis contains this literal. 

Informally speaking, one can obtain a class of formulas by replacing all oc- 
currences of some variables in a formula by “. . . ” . For example, the formula 
{x V y V z) A (x) A {y V z V u) A {uV x) A (z) is a member of the following class 
of formulas: {z . . .) A {z W u . . .) A {u . . .) A (z) . . .. 

Note that in most situations we can work with a class of formulas in the 
same way as with a CNF formula. For example, if we eliminate from the basis of 
a class of formulas all clauses containing literal x and all occurrences of literal 
X from the other clauses, we obtain the class of formulas resulting from all 
formulas of the initial class by setting the value of a; to T rue. Also it is easy to 
see that if after assigning a Boolean value to a variable of a class of formulas 
or applying a (considered in this paper) simplification rule to it, its complexity 
measure decreases by A, then the complexity measure of each formula of this 
class decreases at least by A. 

For a class of formulas T and a clause C we define a class of formulas iF+{C} 
as a class resulting from T by adding clause C to its basis. Similarly we define 
the clause with unfixed length C + {1}, where C is a clause with unfixed length 
and I is a literal. By fixing the length of the clause (?i V . . . V . . .) we mean 
replacing it by the clause (?i V Z 2 V . . . V Zfc)- 

We say that a simplification rule is applicable to a class of formulas, if this 
rule is applicable to every formula of this class. For example, each formula of 
the class (x) A {xV y . . .) A {y . . .) . . . contains pure literal (i.e., a literal that does 
not occur negated in the formula). Similarly, we say that a class of formulas 
has a splitting if each formula of this class has this splitting. For example, every 
formula of the class {xV y) A {xV yV i . . .) A {yV u) A {uW z) A {xV t) A {uV yV z) . . . 
always has the (1, 2)-splitting (on variable x) w.r.t. the number of variables (t 
is the other variable, which is eliminated), the (2, 2)-splitting (on y) w.r.t. the 
number of clauses, and (4, 3)-splitting (on z) w.r.t. the length of the formula. 
However, when we deal with specific problem we can find even better splittings 
(i.e. with smaller splitting numbers) since simplification rules can reduce the 
formula after splitting. 
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3.2 Simplification Rules 

We say that simplification rule is applicable to a formula F if it can replace F 
by a formula F' in polynomial time, so that both following conditions hold: 

— the complexity of F' is smaller than the complexity of F, 

— the solution for F can be constructed from the solution for F' (in polynomial 
time). 

We say that simplification rule is applicable to a class of formulas if it is appli- 
cable to every formula of this class. 

We use the following simplification rules in our program {F denotes a class 
of formulas) : 

1. Pure literal. If F contains a (fc, 0)-literal l^ replace F with F[l\. Used for all 
considered problems. 

2. Unit clause. If (1) G F, replace F with F[l]. Used for SAT only. 

3. Resolution. By elimination from F a variable x by resolution we mean adding 
all resolvents by a; to iF and elimination of all clauses containing a; or a; 
from F (for definition of the resolvent see, for example, [HOO]). This rule 
eliminates by resolution from F all (1, l)-variables in case of MAXSAT and 
(n, 3)-MAXSAT , and all (1, k)- and (2, 2)-variables in case of SAT. 

4. Dominating unit clause. If F contains a literal I, such that the literal I occurs 
in F not more times than the literal I occurs as a unit clause, then replace 
F with F[l]. Used for MAXSAT and (n, 3)-MAXSAT . 

5. Closed subformula. If F contains a closed subformula F' , replace F with 
F — F' . Used for all considered problems. 

6. Almost dominating unit clause. If F contains a literal I, such that I occurs in 
F as a, unit clause k times and I occurs in /c-|- 1 clauses, such that two of them 
contain a pair of complementary literals (it means that at least one of these 
two clauses is always satisfied), replace F with F[V\. Used for MAXSAT and 
(n, 3)-MAXSAT . 

7. Almost common clauses. If F contains two clauses Ci, with fixed lengths 
such that for a literal I C\ — {?} = C 2 — {?}, replace them with a clause 
Cl - {1}. Used for MAXSAT and (n, 3)-MAXSAT . 

8. Satisfying assignment. If there is an assignment for all known variables of 
F satisfying all clauses of the basis of F , then reduce F w.r.t. this assign- 
ment (i.e., assign the value True to all literals of assignment). Used for all 
considered problems. 

All these rules except for the last one can be found in [BR99], [GHNROO], 
and [HOO] , the correctness of the last rule is trivial. 

3.3 Implementation and Data Structures 

In this subsection we briefly describe the implementation of the main procedures 
and data structures of our program. 
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A clause is represented as a list of literals and a flag IsLengthFixed. When 
this flag is set, no literal can be added to a clause. This flag is not set by default, 
and a clause is interpreted as a clause with unfixed length. The operations on a 
clause are fixing its length and adding/deleting a literal. 

A simplification rule is implemented as a procedure that takes a class of 
formulas as an input. It checks whether its own condition holds for the input class 
of formulas, and if it does, it makes corresponding changes in it. For example, 
the pure literal rule checks whether a class of formulas contains {k, 0)-literals 
and assigns the value True to all such literals. 

A class of formulas is represented simply as a list of clauses. The main oper- 
ation on a class of formulas is assignment of a value to a literal. To assign the 
value True to a literal I we remove from a class of formulas all clauses containing 
the literal I and delete all occurrences of the literal 1. 

Another important operation on a class of formulas is finding its possible 
splittings. Our program considers the following splittings (IF is a class of formu- 
las, x,y are literals): 

1. T[x],T[x\, 

2. T[x,y\,T[x,y\,T[x\] 

3. T[x,y\,T[x,y\,T[x,y\,T[x,y\. 

After the construction of the classes of formulas in one of the cases above our 
program applies all simplification rules (given to it as an input) to all these 
classes as long as at least one of them is applicable. And then it calculates the 
resulting splitting number according to the given measure. 



3.4 Algorithm 

The main goal of our program is to prove an upper bound on the running time 
of a splitting algorithm of a form as given in Fig. 1. 

Given a set of simplification rules S, a measure of complexity fx and a number 
SplitNum, our program tries to prove that such an algorithm can always make 
a splitting with a splitting number not exceeding SplitNum. Thus, it tries to 
prove an upper bound 0{SplitNum^) for this algorithm. 

The main part of our program is implemented in the procedure AnalyzeCase, 
which is given in Fig. 2. Informally speaking, this procedure tries to And a good 
splitting for the input class of formulas. If such a splitting is found, it returns, 
otherwise it constructs several new classes of formulas such that their union 
contains the input class of formulas and recursively calls for them. It is easy 
to see that if this procedure terminates on a given class of formulas, then all 
formulas of this class have a good splitting. 

At the first step the procedure AnalyzeCase checks whether at least one 
simplification rule from the set S is applicable to T and returns if such a simpli- 
fication rule exists. This is motivated by the fact that a splitting algorithm (of 
the form described above) makes splittings only on already simplified formulas. 
At step 2 the procedure tries to And a good splitting (i.e., a splitting with a 
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Procedure SplittingAlgorithm 
Input: 

formula F, set of simplification rules S, measure of complexity fi. 

Method 

1. apply all simplification rules from the set 5 to P as long as at least one of them is 
applicable 

2. if the solution for F can be constructed in polynomial time (in particular, if F is 
empty), return the answer 

3. for each pair of variables x and y consider the following splittings: 

(a) F[x],F[x] 

(b) F[x,y],F[x,y],F[x] 

(c) F[x,y],F[x,y],F[x,y],F[x,y] 

choose the one that provides the best splitting number (w.r.t. y) 

4. construct the formulas corresponding to the splitting found at the previous step 
and call SplittingAlgorithm for each of them 

5. return the answer according to the answers returned by the recursive calls 



Fig. 1. Form of splitting algorithms. 



splitting number not exceeding Split Num) for the input class of formulas and 
returns if such exists. 

At step 3 AnalyzeCase simply selects a clause with unfixed length. Note 
that the situation when there is no such a clause is impossible, since otherwise 
all known variables of IF form a closed subformula and hence the corresponding 
rule is applicable. 

At steps 4 and 5 the procedure considers two cases: when the selected clause 
contains at least one more literal and when it does not. In some sense, it splits 
a class of formulas for which it cannot find a good splitting into several classes 
and recursively calls for them. The goal of these two steps is to construct the 
classes of formulas such that their union contains IF. 

We use the relation a at step 5 to define the set of pairs (i,j), such that an 
(f, j)-literal does not give immediately a good (i.e., not worse than SplitNum) 
splitting. Clearly, for any problem this relation may be defined as follows: 

a{K, SplitNum) = {(i,j) : T{i,j) > SplitNum}, 
a{L, SplitNum) = {{i,j) ■ T{i + j,i + j) > SplitNum}, 

where K is the number of clauses and L is the length of the input formula. These 
sets are finite due to the properties of splitting numbers [KL98] . Unfortunately, 
we cannot define such a finite set when we deal with the number of variables 
as a complexity measure. But in case of the (n, 3)-MAXSAT problem we can 
write the following: 

a{N, SplitNum) = {{i,j) ■ i + j < 3}, 
where N is the number of variables. 






256 



Sergey S. Fedin and Alexander S. Kulikov 



Note that at step 3 the procedure can select any clause with unfixed length, 
but, in practice, the size of the resulting case analysis (when we write down 
all the classes of formulas processed by this procedure) depends on the selected 
clause. Current version of our program selects this clause in the following way: 
for each clause with unfixed length it constructs classes of formulas as described 
at steps 4 and 5 and counts the number of bad classes among these constructed 
classes (here we say that a class is bad if it does not satisfy the conditions of the 
steps 1 and 2). And finally, the clause providing a minimal number of bad classes 
is selected. In most situations this heuristic allowed to produce smaller proofs 
than any other one considered by us. However, this heuristic is obviously not the 
best one: for example, it is possible to count bad classes out of bad classes. But 
when we tried this heuristic the running time of our program increased greatly. 
So, we are going to think about smart heuristics. 



Procedure AnalyzeCase 
Input: 

class of formulas T, set of simplification rules S, number SplitNum > 1, measure of 
complexity y. 

Method 

1. if any simplification rule from the set S is applicable to T, then return 

2. if there is a splitting w.r.t. y, m T providing splitting number not exceeding 
SplitNum, then return 

3. select from IF according to some heuristic a clause with unfixed length; call this 
clause C 

4. (a) C7i = C 

(b) fix length of Ci 

(c) call AnalyzeCase(iF+{Ci} — {C}) (all other input parameters of the procedure 
AnalyzeCase are the same here) 

5. (a) introduce new variable to F; call it u 

(b) C = C + {u} 

(c) for all i,j > 0, such that {i,j) € a{y, SplitNum) 

i. construct the set of all possible classes of formulas resulting from F by 
adding occurrences of the variable u for u being an (i, j)-variable 

ii. call AnalyzeCase on each constructed class of formulas 



Fig. 2. Procedure AnalyzeCase. 



3.5 Automatically Proved Upper Bounds 

Our program proved the following upper bounds: 

- SAT: 0(1.272021*'), 0(1.101027*), 

~ MAXSAT: 0(1.37226*^), 0(1.135889*), 

- (n,3)-MAXSAT : 0(1.2852*"), 0(1.236507*'), 0(1.098267*). 
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The output files proving these bounds can be found at 

http : / /logic . pdmi . ras . ru/ ~kulikov (also there are several proofs of simpler 
bounds given). The structure of these files allows to verify each step of our 
program. Note that a big number of cases in all presented proofs does not mean 
that the corresponding algorithms are sophisticated. (Actually the form of all 
these algorithms is given in Fig. 1.) 

4 Related Work 

Independently from us, another two frameworks for automated proofs of upper 
bounds were implemented. First of them, by Nikolenko and Sirotkin [NS03], was 
developed to prove upper bounds for the SAT problem and it uses only one 
simplification rule, namely pure literal rule. It was proved by this framework 
that SAT can be solved in 0(1.56639'^). Another framework was developed 
by Gramm et al. [GGHN03]. In this framework several new bounds for hard 
graph modification problems were proved. In particular, an upper bound for the 
Gluster Editing problem was improved to 0(1.92^) (where k is the number of 
edge modifications allowed) . The program of Gramm et al. allows to work with 
different simplification rules. 

In general, all contemporary frameworks (including ours) are based on the 
following simple observation: it is possible to prove an upper bound for a problem 
just by considering all possible subformulas of the size bounded by a constant. 
The framework of Gramm et al. is, in some sense, a straightforward implemen- 
tation of this fact: their program given a number s constructs a set S of graphs 
each having s vertices such that any possible graph has at least one element of 
the set 5 as a subgraph. For each graph from the set S all possible splittings are 
enumerated and the one providing the best splitting number is selected. And as 
a result, an upper bound corresponding to the worst of these selected splittings 
is proved for a problem. Glearly, the resulting upper bound is as better, as the 
input number s is larger. But with the increasing of s the number of all possible 
splittings for a graph on s vertices increases greatly (as well as the size of 5). 

The differences of our approach from the one by Gramm et al. are the fol- 
lowing: 

1. Our program is given a bound first, and then it tries to to prove this bound, 
while the program by Gramm et al. given a number s tries to prove as better 
bound as possible by considering graphs on s vertices. 

2. The set of all possible small subformulas in out program is constructed in the 
process of proof search , while the construction of similar set is the first stage 
of the program by Gramm et al. Thus, we never know whether our program 
will terminate on a given bound. And also, it is impossible to say in advance 
for which number s (or whether such number s exists) the framework by 
Gramm et al. can prove a given bound. 

3. Our framework do not consider all possible splittings for each class of for- 
mulas (actually it considers only splittings on assignments having not more 
than two literals), as framework by Gramm et al. does. 
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5 Further Directions 

The first direction is thinking about heuristics of quick finding of good split- 
tings for small formulas (or graphs). For example, authors of [GGHN03] indicate 
that in most cases almost all the time (of their program execution) is spent for 
concatenation of splitting vectors. 

Another direction is to think about construction of the set of all possible 
small subformulas (or subgraphs). Glearly, such a set is not unique (for exam- 
ple, different sets may be constructed by selecting different clauses at step 3 of 
AnalyzeCase) and the resulting upper bound may depend on this set. 

All known frameworks do not provide a tool for converting their output files 
into splitting algorithms. The advantage of such an algorithm is in fact that it 
does not have to spend time for finding good splittings and simplifications, since 
all this information may be given in the output file. Thus, it seems like such an 
algorithm could work better than a simple splitting algorithm. 

The currently best algorithm for SAT [HOO] uses the following simplification 
rule: If each clause of a formula that contains a (2, 3“*')-literal contains also a 
(3“*', 2)-literal, then all (3“*', 2)-literals can be assigned the value True. Glearly, 
this rule cannot be added to any known framework, since these frameworks 
consider only formulas (graphs) of the size bounded by a constant. So, it will be 
interesting to somehow introduce this rule into existing frameworks. 

Another non-standard trick is used in currently best algorithm for MAXSAT 
[GK02]: at each iteration it preserves a certain invariant on the input formula. 
We are going to add this feature into our framework. 
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Abstract. As our main result, we give an 0((2.415)*^n") algorithm for 
finding a feedback arc set of weight at most fc in a tournament on n ver- 
tices with each vertex having weight at least 1. This improves the pre- 
viously known bound of 0(\/fc logn) for the problem in unweighted 
tournaments. Here u> is the exponent of the best matrix multiplication 
algorithm. 

We also investigate the fixed parameter complexity of weighted feedback 
vertex set problem in a tournament. We show that (a) when the weights 
are restricted to positive integers, the problem can be solved as fast as the 
unweighted feedback vertex set problem, (b) if the weights are arbitrary 
positive reals then the problem is not fixed parameter tractable unless 
P — NP and (c) when the weights are restricted to be of at least 1, the 
problem can be solved in 0((2.4143)*^n") time. 



1 Introduction and Motivation 

Given a directed graph on n vertices and an integer parameter k, the feedback 
vertex (arc) set problem asks whether the given graph has a set of k vertices 
(arcs) whose removal results in an acyclic directed graph. In the weighted version 
of the problem we are given non negative weights on vertices (arcs) and the 
problem asks whether the graph has a set of vertices (arcs) of weight at most 
k, whose removal makes the graph acyclic. While these problems in undirected 
graphs are known to be fixed parameter tractable [10] (in fact the edge version 
in undirected graphs can be trivially solved), the parameterized complexity of 
these problems in directed graphs is a long standing open problem in the area. 
In fact, there are problems on sequences and trees in computational biology, that 
are related to the directed feedback vertex set problem [6] . 

In this paper, we consider these problems in the special class of directed 
graphs, tournaments. A tournament T = (V, E) is a directed graph in which 
there is exactly one directed arc between every pair of vertices. We give effi- 
cient fixed parameter tractable algorithms for the feedback arc set and weighted 
feedback vertex set problem in tournaments under the framework introduced by 
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Downey and Fellows [4]. Weighted feedback arc set problem in tournaments finds 
application in rank aggregation methods. Dwork et.al [5] have shown that the 
problem of computing the so called Kemeny optimal permutation for k full lists, 
where k is an odd integer, is reducible to the problem of computing a minimum 
feedback arc set problem on a weighted tournament with weights between 1 and 
k-2. 

The feedback vertex set problem in a tournament is a set of vertices that hits 
all the triangles in the tournament. But for the feedback arc set problem (FAS) 

this is not the case. In [11], an 0{Vk n^logn) time algorithm, where to is the 
exponent of the best matrix multiplication algorithm, was developed for FAS 
using the fact that a directed graph with at most k arcs away from a tournament 
has a cycle of length at most Vk. In this paper, we use a different characterization 
of a minimal feedback arc set in directed graphs to develop new algorithms for 
the feedback arc set problem in a tournament. In Section 2, we first show that 
if a subset F of arcs forms a minimal feedback arc set in a directed graph then 
the graph formed after reversing these arcs is acyclic. Such a characterization 
helps us to maintain the tournament structure (since in every recursive step we 
reverse but not delete arcs). We first apply this characterization to develop an 
algorithm for FAS in tournaments taking 0(3^n“) time. We then improve this 
by using a simple branching technique along with the new characterization to 
obtain an 0((2.415)^n“) time algorithm. We also observe that the algorithm, 
and hence the bound, applies for the FAS problem in weighted tournaments as 
well, where weights on the arcs are at least 1. 

In section 3, we consider the weighted version of the feedback vertex set 
problem in tournaments that is known to be fVP-complete[12]. 

As observed before, the feedback vertex set problem in an unweighted tour- 
nament can be written as a 3-hitting set problem and hence we can apply the 
algorithm of [9] to get an 0((2.27)^ -I- n^) algorithm as mentioned in [11]. How- 
ever, this algorithm uses some preprocessing rules which don’t naturally gener- 
alize for the weighted hitting set problem. We consider the following variants of 
the weighted feedback vertex set (WFVS) problem: 

1. Integer- WFVS, where the weights are arbitrary positive integers, 

2. Real- WFVS, where the weights are real numbers > 1, and 

3. General- WFVS, where the weights are positive real numbers. 

We show that the Integer- WFVS and Real- WFVS are fixed parameter tracta- 
ble but General- WFVS is not fixed parameter tractable (FPT) unless P = NP. 
More specifically, we show that the Integer- WFVS can be solved as fast as the 
feedback vertex set problem in an unweighted tournament, which currently has 
running time of 0((2.27)^ -I- n^) [9], and that Real-WFVS can be solved in 
0((2.4143)^n“) time. 

In section 4, we conclude with some remarks and open problems. Throughout 
this paper, by logn and to, we mean, respectively, the logarithm to the base 2 
of n and the exponent of the running time of the best matrix multiplication 
algorithm. By rev(x), where x = (u,v) is an arc of a directed graph, we mean 
the arc (v, u). 
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2 Feedback Arc Set Problem 

Feedback arc set problem is known to be NP-complete for weighted tournaments 
[5] but is still open for unweighted tournaments. It is believed to be NP-complete. 
In this section, we give two fixed parameter tractable algorithm for the feedback 
arc set problem in a tournament. Both of these algorithms are based on the fact 
that a subset of arcs forms a minimal feedback arc set only if the original graph 
becomes acyclic when the direction of these arcs are reversed. The first algorithm 
is a direct application of this and achieves the time bound of 0(3^n“). We 
further improve this to 0((2.415)^n‘^) using a new branching technique along 
with the reversal characterization of minimal feedback arc set. We conclude this 
section by observing that some of these algorithms are applicable for weighted 
tournaments as well as for dense directed graphs. 

2.1 Main Algorithm 

Though some authors use the characterization below as the definition, at least 
for the minimum feedback arc set [8] , we could not find this precise statement (of 
the lemma) in the literature. We give a proof here for the sake of completeness. 

Lemma 1. Reversal Lemma 

Let G = (y, E) he a directed graph and F be a minimal feedback arc set (FAS) 
of G. Let G' he the graph formed from G by reversing the arcs of F in G, then 
G' is acyclic. 

Proof. Assume to the contrary that G' has a cycle G. Then C can not contain 
all the arcs oi E — F, as that will contradict the fact that F is a FAS. Define 
the set rev{F) = {(m, w) | {v,u) G F}. Let C n ren(F) = {/i, / 2 , • ’ ’ > /fe} and 
a = rev{fi). Then the set {ei, C 2 , • • • , Cfe} is a set of arcs of G which are reversed 
and are part of C. Now since each Cj G F, and F is minimal, there exists a cycle 
Ci such that F C\ Ci = {ci}. Now consider the directed graph L induced by the 
arcs of {C, Ci, • • • , Gk} — F — rev{F).. It is clear that L is a directed closed walk 
with all the arcs in the original graph G. In fact, if Vf, Q n C = 0, then F is a 
simple cycle in G, such that Ln F = I!), contradicting the fact that F is a FAS. 
If L is not a simple cycle then we can extract a simple directed cycle from it not 
having any arcs of F, violating the definition of F and that completes the proof. 

□ 

Now we use Lemma 1 to give a simple algorithm for the feedback arc set 
problem in a tournament. 

Algorithm TFAS(T,/c, F)(*T is a tournament, k > 0, and F is a set of arcs.*) 
(Returns a minimal feedback arc set of size at most k, if exists and returns 
‘NO’ otherwise. F is a local variable containing arcs of a partial feedback arc 
set that are reversed from the original T. Initially the algorithm is called by 
TFAS(F,/c,0)) 
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Step 0: If T does not have a directed triangle and k > 0, then answer YES and 
return F. 

Step 1: If /c = 0 and T has a triangle, then answer NO and EXIT. 

Step 2: Find a triangle in T and let {a, b, c} be the arcs of the triangle. 

Step 2a: If rev{a),rev{b) and rev{c) are in F, then answer NO and EXIT. 
Step 2b: If TFAS{T\{x} U rev{x}, fc — 1, E U {a:}) is true for any arc x of 
the triangle such that rev(x) is not in F, then answer YES and return 
F and exit. Otherwise return NO and exit. 

We will need the following easy observations to prove the correctness and the 
runtime of the algorithm. 

Lemma 2. [1] A tournament T = {V,E) has a directed cycle if and only if it 
has a directed triangle. 

Let M be the adjacency matrix of the tournament T. Then T has a directed 
triangle if and only if for some i,j such that 1 < i < j < n, > 1 

and M[j,i] = 1. This can be determined in 0{n^) time. If such a pair (i,j) 
exists, then there exists a k such that M[i,k] = M[k,j] = I which can also 
be determined in 0(n) time. Such a triple {i,j, k} forms a triangle. Further, T 
has a directed cycle of length 4 if and only there exists a pair (z, j) such that 
^ ^ i < j < n, M^[z, j] > 1 and M^[j, z] > 1. If such a pair exists, then as before, 
the witness 4-cycle can also be found in 0{n) time. So we have 

Lemma 3. Let T be a tournament on n vertices. Then we can find a directed 
triangle or a directed cycle of length 4 in T, if it exists, in 0{n‘^) time. 

Theorem 1. Given a tournament T = (V, E) on n vertices, we can determine 
whether it has a feedback arc set of size at most k in 0(3^rz‘^) time. 

Proof. First we will show that the algorithm TEAS finds a minimal feedback arc 
set of size at most k if exists. Since T has a FAS of size at most k if and only if it 
has a minimal FAS of size at most k, then the theorem will follow. Correctness 
of Step 0 and Step 1 follow from Lemma 2. Step 2a answers correctly as by 
Reversal Lemma, the current F can not be extended to a minimal feedback arc 
set of G. In Step 26, we branch on each arc x of the triangle such that rev{x) ^ F, 
because if none of these arcs is picked in the feedback arc set of G, then this 
triangle will survive in G' , obtained by reversing the arcs of F . But then by 
Reversal Lemma, this F is not minimal. So this proves the correctness of the 
algorithm. 

The claimed time bound can easily be seen by observing the fact that k 
decreases at every recursive Step 26 by 1. So the recursion depth is at most k. 
The branching factor at every recursion step is at most 3 and hence by Lemma 3, 
we have the desired time bound for the algorithm. □ 

This already improves the 0{vk n'^logn) time algorithm of [11] for the 
problem. Now we further improve the bound by a new branching technique. 
Lemma 1, and the following lemma. 
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Lemma 4. Let T = {V,A) he a tournament that eontains no induced subgraph 
isomorphic to F\ (see figure 1). Then the minimum feedback vertex and arc set 
problems are solvable in T in 0(nF) time. 

Proof. It is easy to see that a tournament has a subgraph isomorphic to Fi if 
and only if it has a directed 4-cycle. It is also an easy exercise to see that if a 
tournament has no subgraph isomorphic to Fi , then no pair of directed triangles 
in the tournament has a vertex in common. Hence the minimum feedback vertex 
or arc set is obtained by finding all triangles, and picking a vertex/arc from each 
of them. 

Finding all triangles in such a tournament can be done in 0(nP) time as 
follows. First compute the square of the adjacency matrix of the tournament. 
Since the tournament can have at most n/3 triangles, there can be at most n/3 
pairs (i,j) such that I < i < j < n and M‘^[i,j] > 1 and M[j,i] = 1. For each 
such pair, the corresponding witness triangle can be found in 0(n) time. 

□ 




Fig. 1. Fi 



Algorithm BTFAS(T,fc, F){*T is a tournament, k > 0, F is a, set of arcs*) 
(Returns a minimal feedback arc set of size at most k, if exists and returns 
‘NO’ otherwise. F is a local variable containing arcs of a partial feedback arc 
set that are reversed from the original T. Initially the algorithm is called by 
BTFAS(T, fc, 0)) 

Step 0: If T does not have a directed triangle, then answer YES and return F. 
Step 1: If A: = 0 and T has a triangle, then answer NO and EXIT. 

Step 2 : Find an induced subgraph on 4 vertices isomorphic to Fi, if exists, in 
T. Such a subgraph is simply a tournament on 4 vertices having at least two 
directed triangles. Let the vertex set of such an Fi be {1,2, 3, 4} and the 
adjacencies be as in figure 1 (in particular (1,2) is the only arc not part of 
any directed triangle). If no such subgraph exists in T, then go to Step 6. 
Step 3: Let {a,b,c} be the arcs of a triangle in Fi, such that there exists an 
arc X G {a, b, c} for which rev{x) G F. If there is no such triangle in Fi, then 
go to Step 4. 
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Step 3a: If rev{a),rev{b) and rev{c) are in F, then answer NO and EXIT. 
Step 3b: If BTFAS{T\{x} U rev{x),k — 1, E U {a;}) is true for any arc x 
of the triangle such that rev{x) is not in F, then answer YES and exit 
after returning F] else answer NO and exit. 

Step 4: If rev{{l,2)) ^ F then if any of the following recursive calls returns 
true, then answer YES and return the corresponding F and exit, and answer 
NO and exit otherwise. 

In the following, T' is obtained from T by reversing the ‘newly included’ arcs 
of E. 

1. BTFAS{T',k-l,F\j{{2,,A))), 

2. BTFAS{T', fc - 2, F U {(4, 1), (4, 2)}), 

3. BTFAS{T', fc - 2, F U {(4, 1), (2, 3)}), 

4. BTFAS{T', fc - 2, F U {(1, 3), (2, 3)}), 

5. BTFASF{T', k-3,FU {(1, 2), (1, 3), (4, 2)}) 

Step 5 : If rez)((I, 2)) G F, then branch only on the first 4 cases enumerated in 
Step 4. 

Step 6: Solve the problem in polynomial time for the resultant tournament 
using Lemma 4. 



Note that all the induced subgraphs on 4 vertices with at least two directed 
triangles are isomorphic to Fi. In the above algorithm at every step, we first find 
a graph isomorphic to Fi , and then if there exists a directed triangle in Fi with 
all its arcs included in the partial feedback arc set (F) obtained so far, then we 
apply Lemma 1 and answer NO. Otherwise we branch on all the arcs x of the 
triangle such that rev{x) ^ F as by Lemma 1 at least one such arc must be part 
of F. 

If none of the arcs of Fi is part of F, then we branch on all possible minimal 
feedback arc sets of Fi . The only remaining case is when all the arcs x appearing 
in some triangle in F\ are not in F but reu((l, 2)) G F. In this case. Lemma I 
implies that item 5 of Step 2b is not applicable (because the set {(1,3), (4, 2)} is 
not a minimal FAS of Fi). So when we reach Step 6 of the above algorithm, all 
the induced subgraphs on 4 vertices have at most one triangle. And the problem 
now can be solved in polynomial time by Lemma 4 . 

The following lemma follows from Lemma 3 and the fact that a tournament 
has a subgraph isomorphic to Fi if and only if it has a 4 cycle. 

Lemma 5. Given a tournament, a subgraph isomorphic to F\ (see figure 1) can 
be found in 0(nS) time. 



Thus, we get the following recurrence for the time complexity of the algo- 
rithm: 



T{n, k) < max 



2T(n, A: — 1) -I- 0(n‘^) or 

T(n, k-l) + 3F(n, k - 2) + T{n, k - 5) + 0{n‘^) 



The above recurrences solve to 0((2.415) n‘^). So we get the following theo- 



rem. 
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Theorem 2. Given a tournament T = {V^E), we can determine whether it has 
a feedback arc set of size at most k in 0((2.415) n‘^) time. 

We remark that the above algorithm can also be applied for weighted feed- 
back arc set problem in a tournament where the weight of every arc is at least 
1. The only modification is in the last step application of Lemma 4 where we 
choose the minimum weight arc from each triangle. Hence we have 

Theorem 3. Given a tournament T = {V,E), and a weight function tt : E ^ 
such that 7r(e) is at least 1 for every e € E, we can determine whether T 
has a feedback arc set of weight at most k in 0((2.415) n‘^) time. 

2.2 Feedback Arc Set Problem in Dense Directed Graphs 

In this section, we show that the feedback arc set problem is fixed parameter 
tractable for directed graphs which are at most arcs away from a tour- 

nament. We need following lemma to show the desired result. 

Lemma 6. [2] Let G = {V, E) be a strong directed graph with n vertices, m arcs 
and let I > 2. Then if m> ” -e(3-2i^)w-i-(; -i) ^ then the girth of the graph (g{G) ) 
is bounded by 1. 

Theorem 4. Let G be a directed graph with n vertices and m > ( 2 ) — 

arcs. Then the feedback arc set (F AS ) problem is fixed parameter tractable for 

G. 

Proof. For the feedback arc set problem, we can assume without loss of gen- 
erality, that the given directed graph is a strongly connected directed graph. 
(Otherwise, try values up to k in each strongly connected subgraph and take the 
minimum.) 

Then Lemma 6 implies that if a strong directed graph is at most n{g — 2)/2, 
ie. 0{ng), arcs away from a tournament, then its girth is bounded by g, as 

/n\ _ -k (3 - 2g)n -k (g^ - g) ^ 2n{g - 2) -k g - 

yi ) 2 “2 

and 

2n(g-2)+g-g^ ^ g^^ n(g-2) 

2 - ’ 2 ~ 2 

for n > y /{k — 2) and 3 < g < n/2. Hence if the strong directed graph is at most 
j^i+o(i) arcs away from a tournament, then its girth is bounded by So we 

find the shortest cycle in G and then by applying Lemma 1, we branch on each 
arc by reversing the arc. This way we don’t delete any arc and hence at every 
recursive step Lemma 6 ensures a cycle of length at most So we have an 

algorithm for feedback arc set problem in G which takes Cai 

and Judes [3] have observed that is bounded by f{k)n^^^\ where / 

is some function of k, for every fixed n and k. Hence it follows that the feedback 
arc set problem is fixed parameter tractable in G. 

□ 
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3 Feedback Vertex Set in Weighted Tournaments 

In this section, we show that the Integer-WFVS and Real-FVS are in FPT, 
while General-FVS is not in FPT unless P = NP. In fact we show that Integer- 
WFVS can be solved as fast as feedback vertex set problem in an unweighted 
tournament, which currently has a running time of 0((2.27)^-|-n^). Then we show 
that the Real-WFVS can be solved in 0((2.4143)^n^) using a simple branching 
technique. 

3.1 Integer-WFVS 

Now we will show that Integer-WFVS can be reduced to unweighted feedback 
vertex set problem in a tournament via a simple parameterized many-one reduc- 
tion. 

Theorem 5. Integer — WFVS can he solved in the same time as the unweighted 
feedback vertex set in a tournament, up to an additive term polynomial in n and 
k, ie., 0((2.27)'= -b (M^)- 

Proof. We can safely assume that all the weights on the vertices are at most 
A: -b 1 as any vertex having weight strictly more than k can not be a part of any 
minimal feedback vertex set of weight at most k. So given the weight function if 
some vertex v has tt{v) > k then we make tt{v) = fc -b 1. It is easy to see that T 
has feedback vertex set of weight at most k if and only if it has feedback vertex 
set of weight at most k with the modified weight function. 

We will construct a new tournament T' from T as follows: replace each vertex 
V having weight tt(v) = w > 1 with a cluster w' consisting of w vertices. If there 
is an arc {u, v) in the original tournament T then we add an arc from every 
vertex of the cluster u' to every vertex in v' . Now we add intra-cluster arcs so 
that each cluster is transitive. 



u V 




Fig. 2. A Witness Cycle 



Let {v\,V 2 , • • • , w/} be the feedback vertex set {FV S) of weight at most k in 
T. Then the vertices of the corresponding clusters {vi,V 2 , - ■ ■ ,v'i \ form a FV S of 
size at most k in T' . The other direction follows from the observation that every 
minimal feedback vertex set (F) of size at most k in T' is such that either all 
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the vertices of any cluster are there or none of them is. To see this, assume that 
there is a cluster w' such that there is a vertex v € v' in F and a u G v' not in F. 
Since v G F, there exist a witness cycle C such that F fiC = {f }. Now iiu ^ C 
then we get a cycle C in T' by replacing v with u in C such that C F = % 
contradicting the definition of F. Otherwise if u is part of this cycle then the 
length of the cycle is at least 4 and we can obtain a smaller cycle C containing 
u such that C' n T’ = 0 as shown in Figure 2. Here C is {u, ■ ■ ■ ,v,w, - ■ ■ ,x} and 
we can take C as {u, w, • • • , a;}. So this proves the observation. 

Number of vertices in the new instance of the graph is bounded by (fc + l)n, 
and hence by applying the hitting set algorithm on this instance, we solve the 
problem in time O((2.270)^ + (fcn)^). 

□ 



3.2 Real- and General-WFVS 

If the weights are arbitrary reals, but at least 1, then the algorithm for un- 
weighted tournament cannot be directly applied. Here we give an algorithm 
which attains 0((2.4143)^n“) bound. 

Algorithm TFVS(T,fc, 7T, F){*T is a tournament, fc > 0, tt is a weight function 
on V, F is a set of vertices*) 

(Returns a minimal feedback vertex set of weight at most k, if exists and returns 
‘NO’ otherwise. F is a local variable containing vertices of a partial feedback 
vertex set that are deleted from the original T. Initially the algorithm is called 
by TFVS(r,/c,0)) 

Step 0: If T does not have a directed triangle and fc > 0, then answer YES 
and return F. 

Step 1: If /c = 0 and T has a triangle, then answer NO and EXIT. 

Step 2 : Find an induced subgraph on 4 vertices isomorphic to Fi (as in figure 
I) with vertex set, say {1, 2, 3, 4} and the adjacencies be as in figure I. If no 
such subgraph exists, then go to Step 4. 

Step 3 : If any of the following recursive calls result in true, then return YES 
and exit with the corresponding F, and return NO and exit otherwise. In 
the following, T' is obtained by deleting the ‘newly included’ vertices in F. 

1. TFFS'(T',A:-7r(3),7r,FU{3}), 

2. TFFS'(T',A:-7r(4),7r,FU{4}), 

3. TFVS{T', k - 7t(1) - 7t(2), tt, F U {1, 2}) 

Step 4: Solve the minimum weighted feedback vertex problem in polynomial 
time for the resultant tournament using Lemma 4. 

Correctness of Step 0 and 1 follow from Lemma 2. In Step 3, we branch 
on all possible minimal solutions of F\. Step 4 follows from Lemma 4 with the 
modification that in each of the triangles we pick the minimum weight vertex. 
The time taken by the algorithm is bounded by the following recurrence: 

T(n, k) < 2T{n - 1, k - 1) + T{n - 2, k - 2) + 0(n‘^) 

which solves to 0((2.4143)^n‘^). So we have following theorem: 
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Theorem 6. Given a tournament T = (V,E), and a weight function \ V ^ 
such that n{v) is at least 1 for every v & V , then we can determine whether 
T has a feedback vertex set of weight at most k in 0((2.4143)^n“) time. 

General- WFVS problem, where weights can be arbitrary positive reals, is not 
fixed parameter tractable unless P = NP. We show this by proving that it is 
A^P-complete for some fixed constant k (in fact, for fc = 1). Our reduction is from 
the fVP-complete unweighted feedback vertex set problem [12] in tournaments 
by simply defining the weight function tt to be tt(v) = Ifk for all v € V. Then 
the original tournament has a FV S of size k if and only if the resulting weighted 
tournament has a FV S of weight 1. This implies that there cannot be a f{k)n^^^^ 
or even time algorithm for General- WFVS problem unless P = NP. 

Theorem 7. General- WFVS problem is not fixed parameter tractable unless 
P = NP. 



4 Conclusions 

In this paper, we have obtained improved algorithms for parameterized feedback 
arc and vertex set problem on weighted tournaments. We used a characterization 
for minimal feedback arc set which maintains the tournament structure at every 
recursive step. For the feedback arc set problem, the complexity of the algorithms 
in unweighted and weighted (with weights at least 1) are the same while this is 
not the case for the feedback vertex set problem. 

It would be interesting to see whether the unweighted feedback vertex set 
problem on tournaments has some special structure that can be utilized to de- 
velop an algorithm better than that of the 3-hitting set problem. Similarly it 
would be interesting to see whether the best known algorithm for the 3-hitting 
set problem can be used for the feedback arc set problem in unweighted tourna- 
ments to achieve the same time bound. 
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Abstract. The two objectives of this paper are: (1) to articulate three 
new general techniques for designing FPT algorithms, and (2) to apply 
these to obtain new FPT algorithms for Set Splitting and Vertex 
Cover. In the case of Set Splitting, we improve the best previous 
Cl*(72^) FPT algorithm due to Dehne, Fellows and Rosamond [DFR03], 
to by an approach based on greedy loealization in conjunction 

with modeled crown reduction. In the case of Vertex Cover, we de- 
scribe a new approach to 2k kernelization based on iterative compression 
and crown reduction, providing a potentially useful alternative to the 
Nemhauser-Trotter 2k kernelization. 



1 Introduction 

This paper has a dual focus on: (1) the exposition of some new general FPT 
algorithm design techniques, and (2) the description of two concrete applications 
of these techniques to the Vertex Cover and Set Splitting problems. The 
latter is defined: 

Set Splitting 

Instance: A collection T of subsets of a finite set X, 

and a positive integer k. 

Parameter, k 

Question: Is there a subfamily T' C IF 

and a partition of X into disjoint subsets Aq and Xi 
such that for every S G T' , S H Xq ^ 0 and S' n Ai yf 0, 
with \T'\ > kl 

The Set Splitting problem is NP-complete [GJ79] and APX-complete 
[Pe94]. Andersson and Engebretsen [AE97], and Zhang and Ling [ZLOl] pre- 
sented approximation algorithms that provide solutions within a factor of 0.7240 
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and 0.7499, respectively. A 1/2 approximation algorithm for the version of the 
Set Splitting problem where the size of Xq is specified has been described by 
Ageev and Sviridenko [ASOO]. 

It is a straightforward exercise to show that Set Splitting is fixed param- 
eter tractable by the method of color-coding [AYZ95] . One of the techniques we 
will discuss, which we refer to here as greedy localization was first used by Chen, 
Friesen, Jia and Kanj [CFJKOl] (see also Jia, Zhang and Chen [JZC03]). This 
approach can potentially be employed in designing FPT algorithms for many 
different maximization problems. In the case of Set Splitting we present an 
example of the deployment of this approach that yields a significant improve- 
ment over the best previous 0*{72^) FPT algorithm for this problem, due to 
Dehne, Fellows and Rosamond [DFR03]. Here we combine this technique with 
crown reduction (where the reduction rule is guided by a crown decomposition 
of an associated graph that models the situation) and obtain an 0*{S^) FPT 
algorithm for Set Splitting. 

The method of iterative compression could be viewed as in some sense “dual” 
to greedy localization, since it seems to be potentially applicable to a wide range 
of minimization problems. Both of these techniques are in the way of “opening 
moves” that can be used to develop some initial structure to work with. Neither is 
technically deep, but still they can be pointed to as potentially of broad utility in 
FPT algorithm design. A simple application of iterative compression to Vertex 
Cover yields a new 2k Turing kernelization that may offer practical advantages 
over the Nemhauser- Trotter 2k many:l kernelization algorithm. 

We assume that the reader has a basic familiarity with the fundamental con- 
cepts and techniques in the FPT toolkit, as exposited in [DF99, Nie02] (and also 
with the definition of basic combinatorial problems such as Vertex Cover). We 
also assume that the reader is familiar with the research program in “worst-case 
exponential complexity” articulated in the survey paper by Woeginger [Woe03] . 
In particular, we employ the handy O* notation introduced there for FPT re- 
sults, that suppresses the polynomial time contribution of the overall input size 
and focuses attention on the exponential time-complexity contribution of the 
declared parameter. An FPT algorithm that runs in time 0*{8^) thus runs in 
time 0{8^n^) for some constant c independent of the parameter k. 



2 The New Techniques 

There are three new FPT design techniques to which we wish to draw attention: 

• greedy localization 

• iterative compression 

• modeled crown reductions 

The first two are only relatively simple opening moves, but nevertheless these 
deserve wider recognition in the context of FPT algorithm design. 
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2.1 Greedy Localization 

This is an approach that can often be applied to maximization problems. The 
idea is to start off with an attempted greedy solution. For example, in the case 
of the Set Packing algorithm due to Jia, Zhang and Chen, the first step is to 
greedily compute a maximal collection of pairwise disjoint sets. If k are found, 
then of course we are done. Otherwise, we can make the observation that if 
there is any solution (fc pairwise disjoint sets) then every set in the solution 
must intersect our (small) maximal collection. Thus we have gained some ini- 
tial information that narrows our search, “localizes” our efforts to this initial 
structure. 

As Set Splitting is a maximization problem, we will similarly employ here 
an opening move that attempts to find a greedy solution, which similarly either 
succeeds and we are done, or provides us with some initial structure to work with. 
Greedy localization has been employed in a few other recent FPT algorithms 
[FHRST04, PS04, MPS04, FKN04]. 

2.2 Iterative Compression 

This “opening move” to develop initial structure seems first to have been used in 
an FPT algorithm recently described by Reed, Smith and Vetta for the problem 
of determining for a graph G whether k vertices can be deleted to obtain a 
bipartite graph G' (an important breakthrough as this problem has been open 
for some time) [RSV03]. Their approach can be briefly described as follows. 

First, the problem is respecified constructively: we aim for an FPT algorithm 
that either outputs NO, or constructively produces the set of k vertices whose 
removal will make the graph bipartite. 

Second, we attempt a recursive solution (which we will see has a simple 
iterative interpretation). Choose a vertex v, and call the algorithm on G — v. 
This either returns NO, and we can therefore return NO for G, or it returns a 
solution set of size k. By adding v to this set, we obtain a solution of size A: -I- 1 
for G, and what remains to be done is to address the following (constructive) 
compression form of the problem: 

Input: G and a solution S of size fc -|- 1 

Output: Either NO, or a solution of size k, if one exists. 

The iterative interpretation is that we are building the graph up, vertex by 
vertex, and at each step we have a small solution (of size fc -|- 1) and attempt to 
compress it. This interpretation makes clear that our overall running time will 
be 0{n ■ f{n,k)) where /(n, fc) is the running time of our FPT algorithm for 
the compression form of the problem. Of course, all the real work lies there, but 
this overall algorithmic approach, simple as it is, gives us some initial structure 
to work with. The approach is clearly of potential utility for many different 
minimization problems. (For another recent application see [Ma04].) 
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2.3 Modeled Crown Reductions 

Both of our concrete applications, to the Set Splitting and to the Vertex 
Cover problems, also use the recently developed techniques of crown decompo- 
sitions and crown reduction rules. This technique was first introduced by Chor, 
Fellows and Juedes [CFJ04] (a brief exposition can also be found in the re- 
cent survey [F03]). In [CFJ04] the technique is applied to the problems Graph 
Coloring with (n — k) Colors and to Vertex Cover. Crown reduction 
has turned out to be effective for a surprisingly wide range of parameterized 
problems; see also [PS03, FHRST04, PS04, MPS04]. Here we show that crown 
reductions can even be employed on problems that are not about graphs. Our 
0*{8^) FPT algorithm for Set Splitting employs a kernelization rule that is 
based on a crown decomposition in an associated auxiliary graph that models 
some of the combinatorics of the Set Splitting problem. 

The machinery from [CFJ04] that we employ here is next described. 

Definition 1. A crown decomposition of a graph G = (V,E) is a partition of 
the vertices ofG into three disjoint sets H, G and J with the following properties: 

1. G is an independent set in G. 

2. H separates C from J , that is, there are no edges between G and J . 

3. H is matched into C, that is, there is an injective assignment m : H ^ G 
such that V/i G H , h is adjacent to m{h). 

The Crown Rule for Vertex Cover transforms (G,k) into {G',k'), where 
G' = G — C — H, and k' = k — \H\. The Crown Rule for the Graph Coloring 
WITH {n — k) Colors problem is (surprisingly) the same rule applied to G 
[CFJ04]. 

We will use the following lemma from [CFJ04]. 

Lemma 1. If a graph G = (V,E) has an independent set I C V{G) such that 
|fV(/)| < |/| then a nontrivial crown decomposition {C,H,J) with C C I for G 
can he found in time C(|P| -I- |L^|). 

3 An 0 *{ 8 ^) FTP Algorithm for Set Splitting 

The input to the Set Splitting problem consists of a family iF C 2^ of subsets 
of a base set X, and a positive integer k. We can trivially assume that every set 
S G E consists of at least two elements of X. 

The first step of our algorithm is a greedy computation of what we will term 
a witness structure for the instance. The witness structure consists of a collection 
of sets F' C F , and for each of the sets Si G F' a, choice of two distinct elements 
bi G Si and Wi G Si. It is allowed that these chosen elements may coincide, that 
is, for Si yf Sj possibly bi = bj or Wi = Wj (or both). What is also required of the 
witness structure is that the sets B = {5i, 62 , ^r} of black witness elements, 

and W = {wi,W 2 , ■..,Wr} of white witness elements are disjoint. It is clear that 
if we succeed in greedily computing a witness structure with \F'\ = r, then any 
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extension of the disjoint subsets B and VL of X to a bipartition of X will split 
the r sets in T' . 

The first step (greedy localization) is to compute a maximal witness structure 
by the obvious greedy algorithm of repeatedly adding sets to T' so long as this 
is possible. If r > A: then we are done. 

At the end of the greedy step, if we are not done, then the following structural 
claims hold. 

Claim 1. Every set S not in the maximal witness structure collection T' consists 
entirely of black or entirely of white elements, that is, either S' C B or S' C IT. 

Proof. We have assumed that every set contains at least two elements. Consider 
a set S G !F that is not in the maximal witness structure family IF'. If S C BUW, 
then clearly either S C B or S C W else S is split and could be added to IF'. 
Hence suppose that there is an element x G S, where x ^ HU IT. If S contains an 
element of B (or IT) then x could be assigned to IT (or H) and T' augmented by 
S, contradicting our assumption that the witness structure is maximal. Since S 
has at least two elements, the only remaining case is that S contains two distinct 
elements that do not belong to H U IT. But then, one could be assigned to B 
and one to IT and T' could again be augmented, contradicting our assumption. 

The following claim is obvious (but crucial): 

Claim 2. |H| < A: — 1 and |IT| < A: — 1. 

Our algorithm is described as follows: 

Step (1): Greedily compute a maximal witness structure. If k sets are split, 
then report YES and STOP. (If not, then \T'\ < A; — 1.) 

Step (2): Branch on all ways of “recoloring” the (at most) 2{k — 1) elements 
that were colored (placed in either B or IT) in the witness structure. 

Subproblem 

For each recoloring (bipartition) of H U IT into B' and IT' 

Step (3): Determine the number of sets that have been split. If k sets are split 
then report YES and STOP. 

Otherwise 

Step (4): Generate an auxiliary graph G describing the sets that remain unsplit 
and the elements of A — (H U IT) contained in them. 

Step (5): Repeatedly apply the Grown Reduction Rule (described below) to 
the subproblem represented by this graph until a subproblem kernel consisting 
of at most {k — 1) elements not in B' U IT' remains. 

Step (6): After we have exhausted the ability to reduce the subproblem in- 
stance using the Grown Reduction Rule, there can be at most A: — 1 vertices still 
remaining to be assigned a color. Try all 2^“^ ways to color these elements. 
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3.1 The Subproblem 

After re-coloring (partitioning) B \JW into B' and W , some number t of sets 
in T will have been split (that is, have nonempty intersection with both B' 
and W). li t > k then of course we will be done (Step 3). Let Q denote the 
subfamily of T that is not split by B' and W . The subproblem is whether the 
disjoint sets B' and W can be extended to a bipartition of X that splits k sets. 
In other words, the subproblem is to determine if the remaining (yet uncolored) 
elements, those in A — {B' U W) can be colored (black and white, extending B' 
and W) in such a way that at least k' = k — t further sets in Q are split. Note 
that the fate of any set that is a subset of B U IT is completely determined by 
the recoloring into B' and IT': it is either among those split, or no extension 
can split it. Thus in the subproblem, we can restrict our attention (by Claim 
1) to the sets in Q' = Q — C T' . That is, the only candidates for further 

splitting belong to our greedy collection T' (!) and there are at most A: — 1 of 
these. We can therefore observe the following claims concerning the subproblem: 



Claim 3. Every set in Q' contains either two distinct elements of B' (denote 
these sets B) or two distinct elements of IT' (denote these sets W). Furthermore, 
every set in Q' contains at least one element of A — (B' U IT'). 

Claim 4. \B U Wl < fc — 1. 

3.2 Crown Reduction for the Subproblem 

The subproblem is modeled by a bipartite graph with vertex sets Tg U Tw, and 
Vu- The vertices vs of Vb U Tyy correspond, respectively, to the unsplit sets S 
in B and W. The vertices Ux of Vu correspond to the uncolored elements in 
U = X — {B' U IT'). There is an edge between vs and Ux if and only if a; G S'. 
See Figure 2. 




• Un-Split B Sets 
Q Un-Split Vy Sets 
o Uncolored elements 



Fig. 1. Auxiliary Graph 



The graph model of the subproblem may now be used to locate a crown 
decomposition that can be used to kernelize the subproblem instance. 
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By Lemma 1, ii \IA\ > k then we can efficiently compute a nontrivial crown 
decomposition (C,H,J) with C C Vu. Interpreting what this means for the 
subproblem instance, we have identified a nonempty subset H of the unsplit sets 
in ,8 U W (the head) that is matched into a subset C of the uncolored elements 
U, the crown. Furthermore, by the properties of a crown decomposition, the 
elements of C do not belong to any other unsplit sets in BUW. 

We can kernelize the subproblem instance according to the following rule: 

Crown Reduction Rule: In the situation described above, we can reduce the 
subproblem instance by using the matched elements in C to split the sets in Ti, 
augmenting B' and W' accordingly. Thus the reduced subproblem instance is 
modeled by the graph obtained by deleting the vertices that correspond to C 
and Ti. and recalculating k' . 




0 Un-Split B — Sets 
Q Un-Split W — Sets 
, Uncolored Elements 



Fig. 2. Crown Decomposition 



Lemma 2. The crown rule can he used to reduce the number of elements not 
assigned a color to be less than k — 1 in polynomial time. 

Proof. Lemma 1 states that if a graph G = (V,E) has an independent set I C 
V{G) such that |fV(d)| < |/| then G admits a nontrivial crown decomposition 
where the crown set is a subset of I . As long as the number of elements not 
assigned a color is greater than |W U 8| we can find a crown in polynomial time. 
Thus as |WU8| < k — 1 ,by continually applying the crown rule we can reduce 
the number of elements that still need to be assigned a color to A: — 1. 

3.3 Complexity Analysis of the Algorithm 

Theorem 1 The Set Splitting problem for parameter k can be solved in 
0*(8^“i) time. 

Proof. Finding a maximal witness structure can be performed in 0{n) time. 
By Lemma 3, \B U W\ < 2{k — 1). The algorithm will branch into at most 4^“^ 
subproblems. Each branch is a completely contained subproblem with the partial 
splitting of the base set depending on which branch we are on. The Crown Rule 
kernelization results in a subproblem kernel having at most k — 1 uncolored 
elements. Thus there are at most colorings of these to be explored. With at 
most 4^“^ subproblems, each yielding (after kernelization) at most 2 ^~^ branches 
to explore, we get an 0*(8^“^) FPT algorithm. 
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4 A New 2k Kernelization for Vertex Cover 

If we apply the iterative compression technique to the Vertex Cover prob- 
lem, then we are concerned with the following solution compression form of the 
problem, which is specified constructively, rather than as a decision problem: 

Input: A graph G = (V, E) and a (fc -|- l)-element vertex cover V C V. 
Parameter: k 

Output: Either a fc-element vertex cover, or NO if none exists. 

Lemma 1 guarantees a nontrivial crown decomposition if the number of ver- 
tices in V — V exceeds fc -I- 1. Thus we immediately obtain a problem kernel 
having at most 2(fc -|- 1) vertices. This improves the 3fc kernel based on crown 
reduction described in [CFJ04]. 

Note. The astute reader will note that this is not a “kernel” in the usual sense of 
the word (which is generally taken in the sense of many:l polynomial-time reduc- 
tions). Here the 2k kernel that we achieve is actually a series of n kernels, which 
can be formalized as a Turing form of parameterized problem kernelization. 

5 Conclusions and Open Problems 

We have described some new approaches in the design of FPT algorithms, and 
have applied these to two concrete problems. The more substantial of these appli- 
cations is an 0*{S^) FPT algorithm for Set Splitting, significantly improving 
the best previous 0*{12^) algorithm. While our contribution in the case of Ver- 
tex Cover is really little more than a small observation, it is still somewhat 
surprising that after so much attention to this problem there is anything new to 
be said about it. Whether 2fc kernelization via iterative compression and crown 
reduction has any practical advantages over Nemhauser- Trotter kernelization is 
an interesting question for further research along the lines of [ACFLSS04] , where 
it is demonstrated that crown reduction is indeed useful in a practical sense. In 
general, it seems that the articulation of the basic toolkit for FPT algorithm 
design is still, surprisingly, in its infancy. 

Acknowledgement. We thank Daniel Marx for helpful discussions and sug- 
gestions, particularly about the iterative compression technique, for which he 
suggested the name. 
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Abstract. We discuss open questions around worst case time and space 
bounds for NP-hard problems. We are interested in exponential time 
solutions for these problems with a relatively good worst case behavior. 



1 Introduction 

Every problem in NP can be solved in exponential time by exhaustive search: 
Recall that a decision problem is in NP, if and only if there exists a polynomial 
time decidable relation R{x, y) and a polynomial m{\x\) such that for every YES- 
instance a;, there exists a YES-certificate y with |y| < m{x) and R{x, y). A trivial 
exact algorithm for solving instance x enumerates all possible strings y with 
lengths up to m(|a;|), and checks whether any of them yields a YES-certificate. 
Up to polynomial factors that depend on the evaluation time of R{x,y), this 
yields an exponential running time of . 

A natural question is: Can we do better than this trivial enumerative al- 
gorithm? Interestingly, for many combinatorial optimization problems the an- 
swer is YES. Early examples include an 0*( 1.4422”) algorithm for deciding 
3-colorability of an n- vertex graph by Lawler [21]; an O* (1.2599") algorithm 
for finding a maximum independent set in an n- vertex graph by Tarjan & Tro- 
janowski [24]; an O* (1.4142”) algorithm for the SUBSET-SUM problems with 
n integers by Horowitz & Sahni [18]. (The notation 0*{f{n)) is explained at the 
end of this section.) Woeginger [26] surveys many results in this area. 

For some optimization problems, we can reach an improved time complexity, 
but it seems that we have to pay for this with an exponential space complexity. 
Note that algorithms with exponential space complexities are absolutely useless 
for real life applications. In this paper, we discuss a number of results around fast 
exponential time algorithms that come with exponential space complexities. We 
present approaches, tricks, related polynomially solvable problems, and related 
open questions. 

Notation. Throughout this paper, we will use a modified big-Oh notation that 
suppresses polynomially bounded terms. For a positive real constant c, we write 
0*(c") for a time complexity of the form 0(c" • poly(n)). The notations I7*(c") 
and 6>*(c”) are defined analogously. 
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2 Integers and Their Sums 

We start this section with a couple of polynomially solvable problems: An input 
to the first problem “fc-SUM” consists of m integers oi , . . . , a™ and a goal sum S. 
The problem is to decide whether there are k of these integers that add up to S'. 
An input to the second problem “Table-fc-SUM” consists of a k x m table and a 
goal sum S; the entries in row i of the table are denoted by , Ri{m). The 

problem is to decide whether one can choose k integers from this table, exactly 
one from each row, that add up to S. In both problems, the number A: is a fixed 
integer that is not part of the input. Both problems are closely related, and they 
can be reduced to each other in linear time (Erickson [12]). Both problems are 
trivially solvable in polynomial time 0{m^). 

Here is how to get a better time complexity for Table- 2-SUM: Sort the entries 
in the first row. Then for j = 1, . . . ,m perform a binary search for the value 
S — R 2 {j) in this sorted first row. If the search succeeds at Ri{i), then Ri{i) = 
S — R 2 U) and the answer is YES. If all searches fail, then the answer is NO. 

Fact. Table-2-SUM can be solved in 0{mlogm) time and 0{m) space. 

The same approach also yields fast algorithms for Table-A:-SUM for all fc > 3: 
Compute the sum of every |"A:/2] -tuple of integers that has one entry in each of 
the first \k/2\ rows; these sums form the first row in a new table. Compute 
the sum of every [fc/2j -tuple of integers that has one entry in each of the last 
[fc/2j rows; these sums form the second row in the new table. Apply the above 
algorithm to this new instance of Table- 2-SUM. 

Fact. Table-k-SUM can be solved in logm) time and space. 

For odd k, the time complexity can be slightly improved to see 

for instance Erickson [11]. In particular, the 3-SUM problem can be solved in 
0{m?) time. We will not go into details, since in this paper we really do not care 
about logarithmic factors. The main drawback of all these algorithms is their 
horrible space complexity. 

Schroeppel & Shamir [23] improve the space complexity for Table-4-SUM 
by using a data structure that enumerates the sums Ri{i) + R 2 U) with 
1 < b j < m in non-decreasing order. This data structure uses only 0{m) space. 
Every time we kick it, it starts working for O(logm) time steps, and then spits 
out the next larger sum Rx{i) + R 2 {j). The data structure is based on a balanced 
search tree that supports deletions, insertions, and extracting the minimum with 
logarithmic work per operation. It is built as follows: In a preprocessing step, we 
bring the entries in the second row into non-decreasing order. As a consequence, 
we have for every fixed index i that 

i?i (z) -|- (1) Y i?i (z) -|- i? 2 ( 2 ) Y ••• < i?i (z) -|- i? 2 (w) ■ 

For every index z (1 < z < zn), the data structure stores the pair (z, j) that 
corresponds to the first unvisited sum R\{i) + R 2 U) this ordering. Whenever 
the data structure is kicked, it extracts and deletes the pair (z, j) with minimum 
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sum, and inserts the pair {i,j + 1) instead. All in all, the enumeration of the 
sums costs 0{m? log m) time. 

Schroeppel & Shamir [23] use two such data structures; the first one generates 
the sums x = Ri{i) + i? 2 (j) in non-decreasing order, whereas the second one 
generates the sums y = Rs^s) + R 4 {t) in non-increasing order. Whenever x + y < 
S holds, the current value of x is too small for reaching the goal sum S; we 
replace it by the next larger sum Ri{i) + R 2 U) from the first data structure. 
Whenever x+y > S holds, the current value of y is too large for reaching the goal 
sum S; we replace it by the next smaller sum R^^s) + R 4 {t) from the second data 
structure. These steps are repeated over and over again, until one data structure 
becomes empty (answer NO) or until we reach x + y = S (answer YES). 

Fact. Table-i-SUM can be solved in 0(m^ logm) time and 0{m) space. 

Open problem 1 

(a) Is there an O(m^logm) time and 0{m) space algorithm for Table-6-SUM? 

(b) Is there an time algorithm for Table-k-SUM for some integer 

fc > 3 and some real a > 0 ? 

Now let us turn to negative results around the fc-SUM and the Table-fc-SUM 
problem. The 3-SUM problem plays a notorious role in computational geometry. 
Gajentaan & Overmars [15] have put together a long list of geometric problems: 
All problems on this list can be solved in quadratic time, and for all of them 
nobody knows how to do better. All problems on this list contain 3-SUM as 
a special case (under linear time reductions), and for all of them this 3-SUM 
special case (intuitively) seems to be the main obstacle for breaking through 
the quadratic time barrier. One example problem on this list is: Given m (pos- 
sibly overlapping) triangles in the Euclidean plane, compute the area of their 
union. Another one: Given m pairwise non-intersecting straight line segments in 
the Euclidean plane, is there a straight line that separates them into two non- 
empty subsets? And another one: Given m points in the Euclidean plane, are 
some three of them on a common line? For instance, the linear time reduction 
from 3-SUM to 3-POINTS-ON-A-GOMMON-LINE is based on the following 
observation: The x-coordinates of the intersection points of the line y = ax + b 
with the curve y = f{x) = x^ — Sx^ are the roots of x^ — Sx^ — ax — b = 0; 
for every line the sum of these roots equals S, the coefficient of the quadratic 
term. Gonsequently, the point set (ai, /(oi)), ( 02 , 7 ( 02 )), . . . , (a™, /(um)) con- 
tains three points {ax, f{ax)), {ay,f{ay)), {az,f{az)) on a common line, if and 
only if Qx + tty + az = S. The bottom-line of this paragraph is that research on 
the 3-SUM problem is severely stuck at the threshold 0{mf). 

What about the general /c-SUM problem with A: > 4? Here we are stuck 
around the threshold 0{m '^^/'^^ ). Erickson [11] proved an ) lower bound 

on fc-SUM in a certain restricted variant of the linear decision tree model. The 
additional restriction in his model is that every decision step must be based on 
testing the sign of some affine linear combination of at most k elements of the 
input. At first sight, this model seems to be strange, and the lower bound result 
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seems to be quite weak. However, given our general failure in proving reasonable 
lower bounds for algorithmic problems and given the lack of tools in this area, 
Erickson’s lower bound result in fact is a major breakthrough. 

Open problem 2 Prove a non-trivial lower hound for the k-SUM problem in 
the algebraic decision tree model or in the algebraic computation tree model (see 
Ben- Or [4])- 

Downey & Fellows [7,8] have proved that the fc-SUM problem with parameter 
k is W[l]-hard. All these negative results for fc-SUM translate into analogous 
negative results for Table-fc-SUM. 

After this long polynomial time prelude, we will spend the rest of this sec- 
tion on NP-hard problems. In the NP-hard SUBSET-SUM problem, the input 
consists of n positive integers 6 i, . . . , and a goal sum B. The problem is to 
decide whether there exists some subset of the bi that add up to H. The strongest 
known negative result for SUBSET-SUM is an Q{n^) lower bound in the alge- 
braic computation tree model of computation [6,4]. 

On the positive side, Horowitz & Sahni [18] have come up with the following 
approach for SUBSET-SUM: They split the instance into two parts, one part 
with 6 i, . . . , &[n/2j and another part with 6[n/2j-i-ij ■ ■ ■ ,bn- They construct a table 
with two rows, where the first row consists of all the subset sums for the first part, 
and where the second row consists of all the subset sums for the second part. 
The table can be computed in time. The SUBSET-SUM instance has 

answer YES, if and only if the constructed Table- 2-SUM instance with S = B has 
answer YES. Our above discussion of Table- 2-SUM yields the following result. 

Fact. SUBSET-SUM can he solved in time and in 0*(2”/^) space. 

Schroeppel & Shamir [23] follow essentially the same idea, but instead of 
splitting the SUBSET-SUM instance into two parts, they split it into four parts 
of size approximately n/4. This leads to a corresponding instance of Table-4- 
SUM, and to a substantially improved space complexity. 

Fact. SUBSET-SUM can he solved in 0*(2"/^) time and in 0*(2”/^) space. 

Generally, if we split the SUBSET-SUM instance into k > 2 parts, then we 
get a corresponding table with k rows and 0[f2^!^') elements per row. Apply- 
ing the fastest known algorithm to the corresponding instance of Table-fc-SUM 
gives a time complexity of with f(n, k) = n \k/2~\ / k> n/2. Hence, 

this approach will not easily lead to an improvement over the time complexity 
0*(2”/^). Schroeppel & Shamir [23] also construct t{n) time and s{n) space al- 
gorithms for SUBSET-SUM for all s(n) and t(n) with < t(n) < 0*(2”) 

and s^(n) ■ t(n) = 6>*(2”). 

Open problem 3 

(a) Construct an 0*(1.4") time algorithm for SUBSET-SUM. 

(b) Construct an 0*(1.99”) time and polynomial space algorithm for SUBSET- 
SUM. 
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(c) We have seen that positive results for Table-k-SUM yield positive results 

for SUBSET-SUM. Can we establish some reverse statement? Do fast (ex- 
ponential time) algorithms for SUBSET-SUM yield fast (polynomial time) 

algorithms for Table-k-SUM? 

Another NP-hard problem in this area is the EQUAL-SUBSET-SUM problem: 
Given n positive integers 6i, do there exist two disjoint non-empty subsets 

of the bi that both have the same sum. A translation of EQUAL-SUBSET- 
SUM into a corresponding Table-4-SUM instance leads to an 0*(2”) algorithm 
for EQUAL-SUBSET-SUM. It might be interesting to design faster algorithms 
for EQUAL-SUBSET-SUM, and to get some understanding of the relationship 
between fast algorithms for SUBSET-SUM and fast algorithms for EQUAL- 
SUBSET-SUM. 

3 Graphs and Their Cliques and Cuts 

We start this section with the polynomially solvable fc-CLIQUE problem: An 
input consists of an undirected, simple, loopless p- vertex graph G = (V, E). The 
problem is to decide whether G contains a clique on k vertices. We stress that k 
is not part of the input. The fc-CLIQUE problem is easily solved in polynomial 
time 0{p^). 

Itai & Rodeh [19] observed that fast matrix multiplication can be used to 
improve this time complexity for 3-CLIQUE: Recall that the product of two 
p X p matrices can be computed in 0{p^) time, where oj < 2.376 denotes the so- 
called matrix multiplication exponent; see Coppersmith & Winograd [5]. Recall 
that in the £th power of the adjacency matrix A of graph G, the entry at the 
intersection of row i and column j counts the number of walks with l-\-\ vertices 
in G that start in vertex i and end in vertex j. Furthermore, a 3-clique {x, y, z) 
yields a walk x — y — z — x with four vertices from x to x, and vice versa, every 
walk with four vertices from vertex a; to a; corresponds to a 3-clique. Hence, G 
contains a 3-clique if and only if A^ has a non-zero entry on its main-diagonal. 

Fact. The 5-CLIQUE problem for a p-vertex graph can be solved in 0{p'^) time 
(where uj < 2.376 is the matrix multiplication exponent) and in 0{p^) space. 

Nesetfil & Poljak [22] extend this idea to the 3fc-CLIQUE problem: For every 
/c-clique C in G, create a corresponding vertex v{G) in an auxiliary graph. Two 
vertices v{G\) and v{G 2 ) are connected by an edge in the auxiliary graph, if and 
only if Cl U C 2 forms a 2/c-clique in G. Note that the auxiliary graph has 0{p^) 
vertices. Furthermore, graph G contains a 3/c-clique if and only if the auxiliary 
graph contains a 3-clique. 

Fact. The 5k-GLIQUE problem for a p-vertex graph can be solved in 0(p'^^) 
time and 0{p^^) space. 

This approach yields a time complexity of for {3k -\- 1)-CLIQUE, 

and a time complexity of 0(p“ for (3/c-|-2)-CLIQUE. Eisenbrand & Grandoni 
[9] slightly improve on these bounds for {3k -\- 2)-CLIQUE (with k > 2) and for 
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(3A: + 1)-CLIQUE (with 1 < fc < 5). In particular, for 4-CLIQUE [9] gives a time 
complexity of 

Open problem 4 

(a) Design algorithms with better time and/or space complexities for k-CLIQUE! 

(b) Is there an time algorithm for 10-CLIQUE? 

(c) Is 3-CLIQUE as difficult as Boolean matrix multiplication? 

On the negative side, we have that the fc-CLIQUE problem with parameter k 
is W[l]-hard (Downey & Fellows [7,8]). For the variant where k is part of the 
input and k « logn, Feige & Kilian [14] show that a polynomial time algorithm 
highly unlikely to exist. 

Now let us turn to NP-hard problems. In the MAX-CUT problem, the in- 
put consists of an n- vertex graph G = {V,E). The problem is to find a cut of 
maximum cardinality, that is, a subset X C V of the vertices that maximizes 
the number of edges between X and V — X. The MAX-CUT problem can be 
solved easily in 0*(2”) time by enumerating all possible certificates X. Fedin 
& Kulikov [13] present an 0*(2l^l/^) time algorithm for MAX-CUT; however, 
it seems a little bit strange to measure the time complexity for this problem in 
terms of jF j and not in terms of n = jUj. 

Williams [25] developed the following beautiful approach for MAX-CUT: 
We partition the vertex set V into three parts Vo, Vi, V 2 that are of roughly 
equal cardinality n/3. We introduce a complete tri-partite auxiliary graph that 
contains one vertex for every subset Xq C Vq, one vertex for every subset Xi C 
Ui, and one vertex for every subset X 2 C V 2 - For every subset Xi C V and 
every Xj C Vj with j = i-\-l (mod 3), we introduce the directed edge from Xi to 
Xj. This edge receives a weight w{Xi, Xj) that equals the number of edges in G 
between Xi and U — Xi plus the number of edges between Xi and Vj — Xj plus 
the number of edges between Xj and Vi — Xi. Note that for C U (f = 0, 1, 2) 
the cut Ao U Ai U X 2 cuts exactly w{Xq, XQ -\- w(Ai, A 2 ) -I- w{X 2 , Aq) edges in 
G. Consequently, the following three statements are equivalent: 

~ The graph G contains a cut with z edges. 

— The auxiliary graph contains a 3-clique with total edge weight z. 

— There exist non-negative integers zqi, Z12, Z20 with zqi -I- Z12 + 220 = z , such 
that the auxiliary graph contains a 3-clique on three vertices Xi C Vi (i = 
0,1,2) with w(Ao,Ai) = Zqi and w(Ai,A 2 ) = Z 12 and w(A 2 ,Aq) = Z 20 . 

The condition in the third statement is easy to check: There are 0(jif j^) possible 
triples (zoi, Z 12 , Z 20 ) to consider. For each such triple, we compute a correspond- 
ing simplified version of the auxiliary graph that only contains the edges of 
weight Zij between vertices Aj C Vi and Xj C Vj. Then everything boils down 
to finding a 3-clique in the simplified auxiliary graph on 0(2"/^) vertices. 

Fact. The MAX-CUT problem can be solved in 0*(2‘^”/^) time and 0*(2“”/^) 
space. Note that 2“”/^ < 1.732”. 
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Of course, William’s algorithm could also be built around a partition of 
the vertex set V into four parts of roughly equal cardinality n/4, or around a 
partition of the vertex set V into k parts of roughly equal cardinality n/k. The 
problem then boils down to finding a /c-clique in some simplified auxiliary graph 
on 0(2”/^) vertices. With the currently known ^-CLIQUE algorithms, this will 
not give us an improved time complexity. 

Open problem 5 

(a) Design a faster exact algorithm for MAX-CUT. 

(b) Construct an 0*(1.99”) time and polynomial space algorithm for MAX- 
CUT. 

An input of the NP-hard BISECTION problem consists of an n- vertex graph G = 
(V,E). The problem is to find a subset X CV with |X| = n/2 that minimizes 
the number of edges between X and V — X. The approach of Williamson yields 
an 0*(2“”/^) time algorithm for BISECTION. Can you do better? 

4 Sets and Their Subsets 

There is a number of exact algorithms in the literature that attack an NP- 
hard problem by running through all the subsets of an underlying n-element 
ground set, while generating and storing useful auxiliary information. Since an 
n-element ground set has 2" subsets, the time complexities of these approaches 
are typically 17* (2”). And also the space complexities of these approaches are 
typically 17* (2”), since they store and remember auxiliary information for every 
subset. 

A good example for this approach is the famous dynamic programming al- 
gorithm of Held & Karp [17] for the travelling salesman problem (TSP): A trav- 
elling salesman has to visit the cities 1 to n. He starts in city 1, runs through 
the cities 2, 3, . . . , n — 1 in arbitrary order, and finally stops in city n. The dis- 
tance d{i,j) from city i to city j is specified as part of the input. The goal 
is to find a path that minimizes the total travel length of the salesman. The 
dynamic program of Held & Karp [17] introduces for every non-empty subset 
S' C {2, . . . , n — 1} of the cities and for every city i € S a corresponding state 
[S; i]. By Length[S; i] we denote the length of the shortest path that starts in 
city 1, then visits all cities in S — {i} in arbitrary order, and finally stops in 
city i. Clearly, Length[{z}; i] = d(l, i) holds for every z G {2, . . . , n — 1}. And 
for every S C {2, . . . , rz — 1} with jSj > 2 we have 

Length[S;z] = min{LENGTH[S - {z}; j] -I- d(j, z) : jGS-{z}}. 

By processing the subsets S in order of increasing cardinality, we can compute 
the value Length[S; z] in time proportional to jSj. In the end, the optimal travel 
length is given as the minimum min 2 <fe<n-i Length[{2, . . . , rz— 1}; k]-\-d{k, rz). 

Fact. The TSP can be solved in 0*(2") time and 0*(2”) space. 
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Open problem 6 

(a) Construct an exact algorithm for the n-city TSP with 0*(1.99”) time com- 
plexity! 

(b) Construct an exact algorithm for the n-city TSP with 0*(2") time complexity 
and polynomial space complexity! 

In the Hamiltonian path problem, we have to decide for a given graph G = 
(V, E) with vertices whether it contains a spanning path starting in 

vertex 1 and ending in vertex n. The Hamiltonian path problem forms a simpler 
special case of the TSP. Karp [20] (and independently Bax [1]) provided a cute 
solution for the restriction of Problem 6.(b) to this Hamiltonian special case. We 
use the following definitions. A walk in a graph is a sequence ui, . . . , Wfe of vertices 
such that every pair of consecutive vertices is connected by an edge; vertices and 
edges may show up repeatedly in a walk. For a subset S' C P we denote by 
Walk(S) the set of all walks with n vertices in G that start in vertex 1, end in 
vertex n, and avoid all the vertices in S. Let A be the adjacency matrix of G — S. 
Recall that in the kth power of A, the entry at the intersection of row i and 
column j counts the number of walks with k -\- 1 vertices in G — S that start in 
vertex i and end in vertex j. Therefore, the number of walks in Walk(S) can 
be read from matrix 

Fact. For every subset S CV, the cardinality |Walk(S)| can be determined in 
polynomial time. 

If a walk through n vertices in G does not avoid any vertex k, then it must 
visit all the vertices, and hence must form a Hamiltonian path. Consequently, 
the number of Hamiltonian paths from 1 to n in G equals 

n—l 

|Walk(0)| - I IJ WALK({fc})| = (-1)'^' • |Walk(S)|. 

fc=2 SCV 

Here we have used the inclusion-exclusion principle. The sum in the right hand 
side of the displayed equation is straightforward to compute by applying the 
fact discussed above. We only need to remember the partial sum of all the terms 
evaluated so far, and the space used for evaluating one term can be reused in 
evaluating the later terms. All in all, evaluating and adding up the values of 
0(2”) terms yields an 0*(2”) time and polynomial space algorithm for counting 
the number of Hamiltonian paths. The following fact is a trivial consequence of 
this: 

Fact. The Hamiltonian path problem in an n-vertex graph can be solved in 
0*(2") time and polynomial space. 

Eppstein [10] improves on this polynomial space result for Hamiltonian path 
in the special case of cubic graphs: He presents an algorithm that uses 0*(1.297”) 
time and linear space. Bax [2] and Bax & Franklin [3] have extended the 
inclusion-exclusion approach to a number of counting problems around paths 
and cycles in n-vertex graphs. For all these problems, the time complexity is 
0*(2”) and the space complexity is polynomial. 
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Open problem 7 Construct 0*(1.99") time exact algorithms for the following 
counting problems in n-vertex graphs G: 

(a) Count the number of paths between a given pair of vertices in G. 

(b) Count the number of cycles in G. 

(c) Count the number of cycles through a given vertex in G. 

(d) Count the number of cycles of a given length i in G. 

Now let us turn to some relatives of the n-city TSP. For a fixed Hamiltonian 
path from city 1 to city n and for a fixed city k, we denote by the delay of 
city k the length of the subpath between city 1 and city k. In the travelling 
repairman problem (TRP), the goal is to find a Hamiltonian path from city 1 
to city n that minimizes the sum of delays over all cities. In the precedence 
constrained travelling repairman problem (prec-TRP), the input additionally 
specifies a partial order on the cities. A Hamiltonian path is feasible, if it obeys 
the partial order constraints. 

Here is a related scheduling problem SCHED: There are n jobs l,...,n 
with processing times pi,...,p„. The jobs are partially ordered (precedence 
constrained), and if job i precedes job j in the partial order, then i must be 
processed to completion before j can begin its processing. All jobs are available 
at time 0, and job preemption is not allowed. The goal is to schedule the jobs 
on a single machine such that all precedence constraints are obeyed and such 
that the total job completion time minimized; here Gj is the time 

at which job j completes in the given schedule. SCHED is the special case of 
prec-TRP where the distances between cities i ^ j are given by d{i,j) = pj. It is 
quite straightforward to design an 0*(2”) time and 0*(2”) space dynamic pro- 
gramming algorithm for prec-TRP (and for its special cases TRP and SCHED). 

Open problem 8 

(a) Construct an 0*(1.99”) time exact algorithm for TRP or for SCHED or for 
prec-TSP. 

(b) Provide evidence in favor of or against the following claim: If there exists an 
0*(c”) time exact algorithm with c < 2 for one of the four problems TSP, 
TRP, SCHED, prec-TSP, then there exist 0*{c^) time exact algorithms for 
all four problems. 
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Abstract. When combined with high performance computational ar- 
chitectures, methods born of FPT can be used as a practical basis for 
launching systematic attacks on large-scale combinatorial problems of 
significance. Efficient sequential techniques for kernelization and highly 
parallel algorithms for branching will be discussed. The importance of 
maintaining a balanced decomposition of the search space turns out to 
be critical to achieving scalability. Applications abound, perhaps most 
notably in high-throughput computational biology. A toolchain will be 
described that transforms immense quantities of mRNA microarray data 
into instances of the clique problem, which are then solved via vertex 
cover to derive sets of putatively co-regulated genes. This makes it pos- 
sible to narrow the search for cis and trans regulatory structures on scales 
that were previously unthinkable. 
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